Welcome to CodexAITeam, an open-source hub dedicated to the intersection of AI and cultural heritage. Our projects focus on developing innovative tools for the analysis, preservation, and digitization of historical texts and documents, with a current emphasis on Tibetan manuscripts.
Our mission is to bridge cutting-edge artificial intelligence with the needs of the cultural and historical research community. By creating open-source tools, we aim to make historical documents more accessible, foster collaborative innovation, and support academic and public research efforts.
An AI-driven tool for analyzing the layout of complex Tibetan documents, such as Leporellos and folded manuscripts. Key features include:
- Layout detection and segmentation.
- Recognition and extraction of Tibetan page numbers.
- Integration with Kitodo, a platform widely used in libraries for digitization and annotation.
More projects coming soon!
Our work involves:
- AI and Machine Learning: Vision and NLP models.
- Data Processing: Tools for document segmentation and annotation.
- Open Source Libraries: Contributions to platforms like Kitodo.
We’re always looking for contributors passionate about:
- AI/ML development (computer vision, NLP).
- Digital Humanities and historical document analysis.
- Open-source software and collaborative research.
Whether you're a developer, researcher, or enthusiast, we'd love to have you on board. Get involved by:
- Exploring our repositories.
- Opening issues or feature requests.
- Contributing code, ideas, or feedback.