Skip to content
Change the repository type filter

All

    Repositories list

    • ArgoCD config deployment manifests for Torchlite
      0011Updated Jan 21, 2025Jan 21, 2025
    • Backend API service for Torchlite web dashboard
      Python
      02102Updated Jan 21, 2025Jan 21, 2025
    • Hackathon Handbook
      Jupyter Notebook
      1130Updated Jan 17, 2025Jan 17, 2025
    • scwared

      Public
      Home for general documentation about HTRC’s Mellon-funded SCWAReD project.
      0200Updated Jan 12, 2025Jan 12, 2025
    • Torchlite web interface
      TypeScript
      01182Updated Jan 6, 2025Jan 6, 2025
    • Java
      0101Updated Jan 3, 2025Jan 3, 2025
    • Tools for working with HTRC Feature Extraction files
      Python
      123992Updated Dec 28, 2024Dec 28, 2024
    • API access to aggregated EF data for Torchlite
      Scala
      0001Updated Dec 19, 2024Dec 19, 2024
    • Documentation for the TORCHLITE application
      0000Updated Dec 16, 2024Dec 16, 2024
    • Scala
      0001Updated Nov 15, 2024Nov 15, 2024
    • Utility for combining sequence files
      Scala
      0001Updated Nov 14, 2024Nov 14, 2024
    • Tool for extracting files out of sequence files
      Scala
      0001Updated Nov 14, 2024Nov 14, 2024
    • 0000Updated Sep 23, 2024Sep 23, 2024
    • Extracted Features API service
      Scala
      0013Updated Jul 17, 2024Jul 17, 2024
    • Jupyter notebooks demonstrating features of Torchlite
      Jupyter Notebook
      MIT License
      0016Updated Jul 6, 2024Jul 6, 2024
    • Web app for browsing HathiTrust BW.
      Python
      3005Updated Jul 5, 2024Jul 5, 2024
    • Java
      0002Updated Jun 26, 2024Jun 26, 2024
    • Python
      Apache License 2.0
      41100Updated Jun 13, 2024Jun 13, 2024
    • Informational site for HTRC’s 2024 TORCHLITE Hackathon event
      0000Updated May 28, 2024May 28, 2024
    • Jupyter notebook for viewing and analyzing publication information with HTRC TORCHLITE data and APIs.
      Jupyter Notebook
      0000Updated May 25, 2024May 25, 2024
    • handbook

      Public
      Editable files for TORCHLITE Handbook
      JavaScript
      1000Updated May 23, 2024May 23, 2024
    • 0000Updated May 21, 2024May 21, 2024
    • Extracts features (token counts, POS tags, etc.) from a list of HT volumes, to aid in non-consumptive research.
      Scala
      0201Updated May 17, 2024May 17, 2024
    • Used to convert enriched BIBFRAME-XML to HTRC metadata JSONLD
      XSLT
      0001Updated May 17, 2024May 17, 2024
    • Used to extract entities from the BIBFRAME-XML for purposes of enrichment from external sources
      Scala
      0001Updated May 17, 2024May 17, 2024
    • Utility library that can be used for performing header/body/footer identification over a set of pages from a volume.
      Scala
      0001Updated May 17, 2024May 17, 2024
    • Used to perform lookup (resolve) entities via external sources like VIAF, LOC, and WorldCat
      Scala
      0001Updated May 17, 2024May 17, 2024
    • Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.
      Scala
      0101Updated May 17, 2024May 17, 2024
    • Searches Hathifiles for volumes matching given author, title pairs
      Scala
      0001Updated May 17, 2024May 17, 2024
    • Set of utility functions and routines that reduce the boilerplate needed to accomplish some common tasks in Scala.
      Scala
      0101Updated May 17, 2024May 17, 2024