Dataset - Data Catalog Armenia

ARPA Armenian Paraphrase Corpus

Sentential paraphrase detection train, test datasets as well as BERT-based models for the Armenian language.
- HTML
- CSV
Armenian summary dataset

armsummary dataset from Hugging Face
- HTML
- CSV
Armenian language dataset from CC-100, monolingual Datasets from Web Crawl Data

Armenian language dataset extracted from CC-100 research dataset Description from website This corpus is an attempt to recreate the dataset used for training XLM-R. This corpus...
- HTML
- TXT

You can also access this registry using the API (see API Docs).

3 datasets found