- 
    
      Sentiment and Emotion Armenian LexiconsData for Armenian BERT models. All datasets and code can be accessed at ArmenianNLP Github page
- 
    
      Eastern Armenian National Corpus SubcorpusSubcorpus of EANC with various filters (authors, titles, genres, prose/poetry, original/translated, classical/new orthography
- 
    
      Armenia-related Books in the HathiTrust Digital LibraryCatalogue of all texts available in the libraries cooperating with HathiTrust. The keywords are 'Armenia' and 'Armenian', the database contains links to each book page and...
- 
    
      All Unicode Armenian FontsFree and non-commercial Armenian fonts
- 
    
      Armenian legislation database from ARLISArmenia legislation database extracted from the ARLIS website (arils.am) with all metadata and texts of Armenian laws and other legal documents. The dataset is relatively big,...
- 
    
      Eastern Armenian National Corpus Electronic LibraryThe corpus contains 4547379 words from 104 books of 12 authors
- 
    
      National Library of Armenia repository REST APIREST API of the DSpace installation of National Library of Armenia repository.
- 
    
      ARPA Armenian Paraphrase CorpusSentential paraphrase detection train, test datasets as well as BERT-based models for the Armenian language.
- 
    
      Armenian summary datasetarmsummary dataset from Hugging Face
- 
    
      pioNER - named entity annotated datasetspioNER corpus provides gold-standard and automatically generated named-entity datasets for the Armenian language. Published under Apache 2.0 license
- 
    
      Armenian wikipedia (hywiki) XML dumpsDumps of the Armenian wikipedia provided by Wikimedia foundation. Available as gzipped XML files
- 
    
      Armenian language dataset from CC-100, monolingual Datasets from Web Crawl DataArmenian language dataset extracted from CC-100 research dataset Description from website This corpus is an attempt to recreate the dataset used for training XLM-R. This corpus...