FSD4041 PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023

The dataset is (D) available only by permission from the data depositor/creator.

Download the data

Study description in other languages

Related files

Study title

PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023

Dataset ID Number

FSD4041

Persistent identifiers

https://urn.fi/urn:nbn:fi:fsd:T-FSD4041
https://doi.org/10.60686/t-fsd4041

Data Type

Quantitative

Authors

Abstract

The PsyCoLaGe data collected in Spain and Mexico is part of the research project 'Gender, Society and Language Use: Evidence from Mexico and Spain” and contains the project's psychological data. The project also collected the CoLaGe corpus, which has been archived in the Language Bank of Finland (for further information, see 'Other Material”). The PsyCoLaGe data can be used to examine, for example, various research questions related to gender, gender roles and gender stereotypes, and it can additionally be used together with the corpus.

The data were used to examine respondents' attitudes and perceptions primarily regarding gender and partly sexuality. This was carried out using various instruments measuring psychological characteristics, which assessed, for example, respondents' views on gender role ideologies, attitudes towards gay men, behaviour-based gender role self-concept, and perceptions of the relationship between voice characteristics and gender. The modes of measurement varied. Respondents were asked, for example, to evaluate whether the sentences presented in the questionnaire were uttered by women or men, whether certain voice or personality traits were typical of women, men or both genders, and whether they agreed or disagreed with statements concerning gender.

The variable labels in the archived dataset include information on which instrument measuring psychological characteristics each variable belongs to. Some value labels of the variables include annotations indicating whether the voice features, sentences, and descriptions of everyday tasks presented to respondents are female‑typical or male‑typical, and whether the presented statement is egalitarian or traditional. These annotations are based on the instruments used to measure psychological characteristics.

The full questions, question preambles, and response options can be found in the questionnaire (quF4041_mul.pdf). The questions were presented to respondents either in Peninsular Spanish or Mexican Spanish depending on their nationality, not in English.

The background variables of the dataset are nationality, gender, sexual orientation and a dichotomous age group.

Keywords

attitudes; femininity; gender expression; gender role; homosexuality; masculinity; personality; sex; speech

Topic Classification

Series

Individual datasets

Distributor

Finnish Social Science Data Archive

Access

The dataset is (D) available only by permission from the data depositor/creator.

Data Collector

  • Kachel, Sven (University of Helsinki)
  • Posio, Pekka (University of Helsinki)
  • Uclés Ramada, Gloria (University of Helsinki)
  • González Guzmán, Grecia (University of Guadalajara)

Funders

  • Kone Foundation (202007066)

Time Period Covered

2022 – 2023

Collection Dates

2022-02 – 2023-06

Nation

Spain, Mexico

Geographical Coverage

Valencia, Guadaljara

Analysis/Observation Unit Type

Individual

Universe

The adult population of Valencia and Guadalajara

Time Method

Cross-section

Sampling Procedure

Non-probability: Availability

Non-probability: Respondent-assisted

Non-probability: Purposive

Respondents were initially recruited through social media. To achieve a sufficiently large sample size, additional respondents were recruited through the researchers' personal contacts (excluding friends and relatives). In addition, respondents who had already been reached recruited further participants (i.e. snowball sampling).

Collection Mode

Self-administered questionnaire: Web-based (CAWI)

Psychological measurements and tests

Research Instrument

Structured questionnaire

Data File Language

Downloaded data package may contain different language versions of the same files.

The data files of this dataset are available in the following languages: multilingual.

FSD translates quantitative data into English on request, free of charge. More information on ordering data translation.

Data Version

1.0

Completeness of Data and Restrictions

For confidentiality reasons, the researchers anonymised the data by removing variables describing the respondent's level of education, main activity, marital status, number of children, childhood region, size of childhood region, and the country of residence at the time of data collection. For the same reason, variables indicating the respondent's age and sexual orientation were recoded into broader categories. In addition, the responses of four participants were removed from the archived dataset to ensure their anonymity.

Additional items that are not part of the original scale have been added to the BSRI scale in this dataset.

The last six characters of the variable Informant_ID, consisting of four letters and two digits, were generated randomly and form the respondent ID. The other characters in the variable indicate the respondent's corpus, gender, and age group.

The dataset does not include the original date variable, as it did not function as intended across all file formats. Two new variables, Recording_time_year and Recording_time_month, have been created in the archived dataset based on the original variable. Together, these variables contain the same information as the original variable.

Weighting

There are no weight variables in the data.

Citation Requirement

The data and its creators shall be cited in all publications and presentations for which the data have been used. The bibliographic citation may be in the form suggested by the archive or in the form required by the publication.

Bibliographical Citation

Kachel, Sven (University of Helsinki) & Posio, Pekka (University of Helsinki): PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023 [dataset]. Data version 1.0 (2026-02-24). Finnish Social Science Data Archive [distributor]. DOI: https://doi.org/10.60686/t-fsd4041; URN: https://urn.fi/urn:nbn:fi:fsd:T-FSD4041

Deposit Requirement

Notify FSD of all publications where you have used the data by sending the citation information to user-services.fsd@tuni.fi.

Special Terms and Conditions for Access

The research group collected both linguistic and psychological survey data. The linguistic corpus data are archived at the Language Bank of Finland and the psychological survey data with FSD. The research group exclusively holds a master key document linking informant codes across the two datasets; this document is available only from the authors of the dataset upon reasonable request.

Disclaimer

The original data creators and the archive bear no responsibility for any results or interpretations arising from the reuse of the data.

Other Material

See downloadable files at the top of the page.

A speech and text corpus archived in the Language Bank of Finland for research on language and gender in Mexico and Spain

The corpus was collected from the same respondents who participated in the FSD4041 dataset archived at the Finnish Social Science Data Archive: PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023.

Related Materials

PsyCoLaGe: Data and Measurement Description. Included as an appendix to the dataset.

Related Publications Tooltip

Posio, P., *Kachel, S., & Uclés Ramada, G. (2024). Morphosyntactic stereotypes of speakers with different genders and sexual orientations: An experimental investigation. Linguistics, online first. https://doi.org/10.1515/ling-2022-0143

Posio, P., Kachel, S., & Uclés Ramada, G. (2025). Sociolinguistic and functional variation in the use of direct reported speech in Spanish in the corpus CoLaGe-Valencia. Spanish in Context, online first. https://doi.org/10.1075/sic.24023.pos

Uclés Ramada, G., Kachel, S., & Posio, P. J. (2025). Conflict, gender, and amount of talk: Gender differences in Spanish role play data. Pragmatics and Society. https://doi.org/10.1075/ps.23144.ucl

Study description in machine readable DDI-C 2.5 format

Creative Commons License
FSD:n aineistokuvailut (FSD metadata records) by Suomen yhteiskuntatieteellinen tietoarkisto (Finnish Social Science Data Archive) are licensed under a Creative Commons 1.0 Universal (CC0 1.0) license.