Skip to content

User stories and use cases

Erick edited this page Sep 10, 2015 · 1 revision

Here are a variety of things that users should be able to do with Tethne. This is not a complete list, but should provide a starting-point.

Parse bibliographic data

  • Parse bibliographic records downloaded from the ISI Web of Science in the "field tagged"/"plain text" format.
  • Parse bibliographic records and N-grams downloaded from the JSTOR Data-for-Research Portal
  • Parse bibliographic records in Zotero RDF format
    • Extract full-text from PDF, HTML, or other files attached to Zotero RDF data

Combine bibliographic datasets

  • Combine WoS data (with cited references) with JSTOR DfR data (with wordcounts) that describe the same documents (e.g. each resulting Paper has both cited references and wordcounts)
  • Combine WoS data (with cited references) with Zotero data
  • Combine Zotero data with JSTOR DfR data

Generate networks

  • Generate co-author networks, using data from WoS, DfR, and/or Zotero
  • Generate co-citation networks, using data from WoS
  • Generate bibliographic-coupling networks, using data from WoS
  • Generate networks of documents based on their shared topics
  • Generate networks of terms based on their shared topics
  • Generate networks of terms based on their co-occurrence in abstracts of Papers
  • Generate networks of terms based on their co-occurrence (as N-grams/Features) in papers
  • Generate networks of terms based on their co-occurrence in full-texts of papers
  • Generate networks of authors based on shared topics in their papers
  • Generate networks of journals based on shared topics in their papers
  • Generate networks of journals based on shared authors that publish in those journals

Topic modeling

  • Perform LDA topic modeling using N-grams (Features)
  • Perform LDA topic modeling using full-texts
  • Perform Latent Semantic Analysis using N-grams (Features)
  • Perform Latent Semantic Analysis using data from full-texts
  • Other kinds of text analysis?

Export networks

  • Export networks to GraphML
  • Export networks to RDF