-
Notifications
You must be signed in to change notification settings - Fork 32
User stories and use cases
Erick edited this page Sep 10, 2015
·
1 revision
Here are a variety of things that users should be able to do with Tethne. This is not a complete list, but should provide a starting-point.
- Parse bibliographic records downloaded from the ISI Web of Science in the "field tagged"/"plain text" format.
- Parse bibliographic records and N-grams downloaded from the JSTOR Data-for-Research Portal
- Parse bibliographic records in Zotero RDF format
- Extract full-text from PDF, HTML, or other files attached to Zotero RDF data
- Combine WoS data (with cited references) with JSTOR DfR data (with wordcounts) that describe the same documents (e.g. each resulting Paper has both cited references and wordcounts)
- Combine WoS data (with cited references) with Zotero data
- Combine Zotero data with JSTOR DfR data
- Generate co-author networks, using data from WoS, DfR, and/or Zotero
- Generate co-citation networks, using data from WoS
- Generate bibliographic-coupling networks, using data from WoS
- Generate networks of documents based on their shared topics
- Generate networks of terms based on their shared topics
- Generate networks of terms based on their co-occurrence in abstracts of Papers
- Generate networks of terms based on their co-occurrence (as N-grams/Features) in papers
- Generate networks of terms based on their co-occurrence in full-texts of papers
- Generate networks of authors based on shared topics in their papers
- Generate networks of journals based on shared topics in their papers
- Generate networks of journals based on shared authors that publish in those journals
- Perform LDA topic modeling using N-grams (Features)
- Perform LDA topic modeling using full-texts
- Perform Latent Semantic Analysis using N-grams (Features)
- Perform Latent Semantic Analysis using data from full-texts
- Other kinds of text analysis?
- Export networks to GraphML
- Export networks to RDF