Overview
ARCH is developed actively with input from users and their stakeholders. The current development roadmap is outlined below.
❓ Questions? Reach out to the ARCH team anytime at: arch [at] archive [dot] org. |
Roadmap
Planned |
Explanation |
Example |
Computer vision jobs for dataset generation |
Integration with the Internet Archive’s Wayback Machine collection |
Integration with Vault collections |
Multiple OCR models |
Speech to text dataset generation |
Collection sourcing tools for account administrators |
Web archive transformation (WAT) and named entity dataset format improvements |
Additional named entity dataset languages |
Open source API |
Additional Jupyter Notebooks for exploratory data analysis |
Custom collection filtering and faceting capabilities |
|
Customization tools for additional collection types |
Multi-institutional, digital collection ingest |
|
More information
To learn more and contribute to ARCH development under the AGPL-3.0 license, see: ARCH Github repository.
Comments
0 comments
Please sign in to leave a comment.