datasets

I have collaborated on a number of datasets that are freely accessible:


Estonian paradigms in phonemic notation: (with Sacha Beniamine)

LINK TO DATASET

This dataset includes all inflected forms of 5,000 nouns and 5,000 verbs in Estonian, presented in phonemic form.

Cite as:

Beniamine, Sacha. Aigro, Mari. Baerman, Matthew. Copot, Maria. (2023) Estonian Paradigms in Phonemic Notation. DOI 10.5281/zenodo.8383522

Publications:

Beniamine, Sacha. Aigro, Mari. Baerman, Matthew. Bouton, Jules. Copot, Maria. (in review) Eesthetic: A Paralex Lexicon of Estonian Paradigms.


Semantic ratings for 36,000 Estonian words: (with Mariann Proos)

LINK TO SEARCH ENGINE

LINK TO CSV DATASET

This dataset includes 8 different ratings for Estonian words, all based on human judgments collected between 2021-2023. Concreteness ratings cover 36,125 lemmas while affectivity (activation, valence) and modality ratings (seeing, hearing, smelling, tasting and touching) cover 439 lemmas. All are expressed on a scale from 0-10 and only words with a minimum of 9 ratings are included.

Cite as:

Proos, Mariann. Aigro, Mari. (2024) Concreteness, modality and affectiveness ratings for Estonian words. DOI 10.17605/OSF.IO/Y4BN5. Available at: https://osf.io/y4bn5/

Publications:

Proos, M. Aigro, M. (2023) Concreteness ratings for 36,000 Estonian words. Behavior Research Methods. Published online.

Aigro, M. Proos, M. (2022) Konkreetsushinnangute kogumine eestikeelsetele sõnadele [Collecting concreteness ratings for Estonian words]. Estonian Papers in Applied Linguistics 18: 5-20.


Homonyms in Estonian

LINK TO DATASET

This dataset includes 575 lemmas in Estonian, the reference form of which is homonymous (identical in orthographic form) with another lemma’s reference form. These lemmas are presented with two semantically distinct senses in the largest online reference dictionary Sõnaveeb (www.sonaveeb.ee). Reference form is defined as nominative singular for nouns, ma-infinitive for verbs and the only available form for words without inflection.

Cite as:

Aigro, Mari. (2022) Homonyms in Estonian. DOI 10.17605/OSF.IO/BYJVP. Available at: https://osf.io/byjvp/