FSD4041 PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023
The dataset is (D) available only by permission from the data depositor/creator.
Download the data
Study description in other languages
Related files
Study title
PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023
Dataset ID Number
FSD4041
Persistent identifiers
https://urn.fi/urn:nbn:fi:fsd:T-FSD4041https://doi.org/10.60686/t-fsd4041
Data Type
Quantitative
Authors
- Kachel, Sven (University of Helsinki)
- Posio, Pekka (University of Helsinki)
Abstract
The PsyCoLaGe data collected in Spain and Mexico is part of the research project 'Gender, Society and Language Use: Evidence from Mexico and Spain” and contains the project's psychological data. The project also collected the CoLaGe corpus, which has been archived in the Language Bank of Finland (for further information, see 'Other Material”). The PsyCoLaGe data can be used to examine, for example, various research questions related to gender, gender roles and gender stereotypes, and it can additionally be used together with the corpus.
The data were used to examine respondents' attitudes and perceptions primarily regarding gender and partly sexuality. This was carried out using various instruments measuring psychological characteristics, which assessed, for example, respondents' views on gender role ideologies, attitudes towards gay men, behaviour-based gender role self-concept, and perceptions of the relationship between voice characteristics and gender. The modes of measurement varied. Respondents were asked, for example, to evaluate whether the sentences presented in the questionnaire were uttered by women or men, whether certain voice or personality traits were typical of women, men or both genders, and whether they agreed or disagreed with statements concerning gender.
The variable labels in the archived dataset include information on which instrument measuring psychological characteristics each variable belongs to. Some value labels of the variables include annotations indicating whether the voice features, sentences, and descriptions of everyday tasks presented to respondents are female‑typical or male‑typical, and whether the presented statement is egalitarian or traditional. These annotations are based on the instruments used to measure psychological characteristics.
The full questions, question preambles, and response options can be found in the questionnaire (quF4041_mul.pdf). The questions were presented to respondents either in Peninsular Spanish or Mexican Spanish depending on their nationality, not in English.
The background variables of the dataset are nationality, gender, sexual orientation and a dichotomous age group.
Keywords
attitudes; femininity; gender expression; gender role; homosexuality; masculinity; personality; sex; speech
Topic Classification
- Social sciences (Fields of Science Classification)
- Humanities (Fields of Science Classification)
- Gender and gender roles (CESSDA Topic Classification)
- Social behaviour and attitudes (CESSDA Topic Classification)
Series
Individual datasetsDistributor
Finnish Social Science Data Archive
Access
The dataset is (D) available only by permission from the data depositor/creator.
Data Collector
- Kachel, Sven (University of Helsinki)
- Posio, Pekka (University of Helsinki)
- Uclés Ramada, Gloria (University of Helsinki)
- González Guzmán, Grecia (University of Guadalajara)
Funders
- Kone Foundation (202007066)
Time Period Covered
2022 – 2023
Collection Dates
2022-02 – 2023-06
Nation
Spain, Mexico
Geographical Coverage
Valencia, Guadaljara
Analysis/Observation Unit Type
Individual
Universe
The adult population of Valencia and Guadalajara
Time Method
Cross-section
Sampling Procedure
Non-probability: Availability
Non-probability: Respondent-assisted
Non-probability: Purposive
Respondents were initially recruited through social media. To achieve a sufficiently large sample size, additional respondents were recruited through the researchers' personal contacts (excluding friends and relatives). In addition, respondents who had already been reached recruited further participants (i.e. snowball sampling).
Collection Mode
Self-administered questionnaire: Web-based (CAWI)
Psychological measurements and tests
Research Instrument
Structured questionnaire
Data File Language
Downloaded data package may contain different language versions of the same files.
The data files of this dataset are available in the following languages: multilingual.
FSD translates quantitative data into English on request, free of charge. More information on ordering data translation.
Data Version
1.0
Completeness of Data and Restrictions
For confidentiality reasons, the researchers anonymised the data by removing variables describing the respondent's level of education, main activity, marital status, number of children, childhood region, size of childhood region, and the country of residence at the time of data collection. For the same reason, variables indicating the respondent's age and sexual orientation were recoded into broader categories. In addition, the responses of four participants were removed from the archived dataset to ensure their anonymity.
Additional items that are not part of the original scale have been added to the BSRI scale in this dataset.
The last six characters of the variable Informant_ID, consisting of four letters and two digits, were generated randomly and form the respondent ID. The other characters in the variable indicate the respondent's corpus, gender, and age group.
The dataset does not include the original date variable, as it did not function as intended across all file formats. Two new variables, Recording_time_year and Recording_time_month, have been created in the archived dataset based on the original variable. Together, these variables contain the same information as the original variable.
Weighting
There are no weight variables in the data.
Citation Requirement
The data and its creators shall be cited in all publications and presentations for which the data have been used. The bibliographic citation may be in the form suggested by the archive or in the form required by the publication.
Bibliographical Citation
Kachel, Sven (University of Helsinki) & Posio, Pekka (University of Helsinki): PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023 [dataset]. Data version 1.0 (2026-02-24). Finnish Social Science Data Archive [distributor]. DOI: https://doi.org/10.60686/t-fsd4041; URN: https://urn.fi/urn:nbn:fi:fsd:T-FSD4041
Deposit Requirement
Notify FSD of all publications where you have used the data by sending the citation information to user-services.fsd@tuni.fi.
Special Terms and Conditions for Access
The research group collected both linguistic and psychological survey data. The linguistic corpus data are archived at the Language Bank of Finland and the psychological survey data with FSD. The research group exclusively holds a master key document linking informant codes across the two datasets; this document is available only from the authors of the dataset upon reasonable request.
Disclaimer
The original data creators and the archive bear no responsibility for any results or interpretations arising from the reuse of the data.
Other Material
See downloadable files at the top of the page.
The corpus was collected from the same respondents who participated in the FSD4041 dataset archived at the Finnish Social Science Data Archive: PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023.
Related Materials
PsyCoLaGe: Data and Measurement Description. Included as an appendix to the dataset.
Related Publications
Study description in machine readable DDI-C 2.5 format

FSD:n aineistokuvailut (FSD metadata records) by Suomen yhteiskuntatieteellinen tietoarkisto (Finnish Social Science Data Archive) are licensed under a Creative Commons 1.0 Universal (CC0 1.0) license.