FSD4041 PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023

Aineiston nimi

PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023

Aineistonumero

FSD4041

Pysyvät tunnisteet

https://urn.fi/urn:nbn:fi:fsd:T-FSD4041
https://doi.org/10.60686/t-fsd4041

Aineiston laatu

Kvantitatiivinen aineisto

Tekijät

Sisällön kuvaus

The PsyCoLaGe data collected in Spain and Mexico is part of the research project 'Gender, Society and Language Use: Evidence from Mexico and Spain” and contains the project's psychological data. The project also collected the CoLaGe corpus, which has been archived in the Language Bank of Finland (for further information, see 'Other Material”). The PsyCoLaGe data can be used to examine, for example, various research questions related to gender, gender roles and gender stereotypes, and it can additionally be used together with the corpus.

The data were used to examine respondents' attitudes and perceptions primarily regarding gender and partly sexuality. This was carried out using various instruments measuring psychological characteristics, which assessed, for example, respondents' views on gender role ideologies, attitudes towards gay men, behaviour-based gender role self-concept, and perceptions of the relationship between voice characteristics and gender. The modes of measurement varied. Respondents were asked, for example, to evaluate whether the sentences presented in the questionnaire were uttered by women or men, whether certain voice or personality traits were typical of women, men or both genders, and whether they agreed or disagreed with statements concerning gender.

The variable labels in the archived dataset include information on which instrument measuring psychological characteristics each variable belongs to. Some value labels of the variables include annotations indicating whether the voice features, sentences, and descriptions of everyday tasks presented to respondents are female‑typical or male‑typical, and whether the presented statement is egalitarian or traditional. These annotations are based on the instruments used to measure psychological characteristics.

The full questions, question preambles, and response options can be found in the questionnaire (quF4041_mul.pdf). The questions were presented to respondents either in Peninsular Spanish or Mexican Spanish depending on their nationality, not in English.

The background variables of the dataset are nationality, gender, sexual orientation and a dichotomous age group.

Asiasanat

attitudes; femininity; gender expression; gender role; homosexuality; masculinity; personality; sex; speech

Tieteenala/Aihealue

Sarja

Individual datasets

Jakelija

Finnish Social Science Data Archive

Käyttöoikeudet

The dataset is (D) available only by permission from the data depositor/creator.

Kerääjät

  • Kachel, Sven (University of Helsinki)
  • Posio, Pekka (University of Helsinki)
  • Uclés Ramada, Gloria (University of Helsinki)
  • González Guzmán, Grecia (University of Guadalajara)

Rahoittajat

  • Kone Foundation (202007066)

Ajallinen kattavuus

2022 – 2023

Aineistonkeruun ajankohta

2022-02 – 2023-06

Maa

Spain, Mexico

Kohdealue

Valencia, Guadaljara

Havaintoyksikkötyyppi

Individual

Perusjoukko/otos

The adult population of Valencia and Guadalajara

Tutkimuksen aikaulottuvuus

Cross-section

Otantamenetelmä

Non-probability: Availability

Non-probability: Respondent-assisted

Non-probability: Purposive

Respondents were initially recruited through social media. To achieve a sufficiently large sample size, additional respondents were recruited through the researchers' personal contacts (excluding friends and relatives). In addition, respondents who had already been reached recruited further participants (i.e. snowball sampling).

Keruumenetelmä

Self-administered questionnaire: Web-based (CAWI)

Psychological measurements and tests

Keruuväline tai –ohje

Structured questionnaire

Datatiedostojen kieli

Aineistopaketti voi sisältää samoja tiedostoja eri kielisinä.

Aineisto sisältää datatiedostoja seuraavilla kielillä: monikielinen.

Tietoarkisto kääntää kvantitatiivisia datatiedostoja englanniksi. Lisätietoja käännöspyynnön jättämisestä.

Datan versio

1.0

Aineiston käytössä huomioitavaa

For confidentiality reasons, the researchers anonymised the data by removing variables describing the respondent's level of education, main activity, marital status, number of children, childhood region, size of childhood region, and the country of residence at the time of data collection. For the same reason, variables indicating the respondent's age and sexual orientation were recoded into broader categories. In addition, the responses of four participants were removed from the archived dataset to ensure their anonymity.

Additional items that are not part of the original scale have been added to the BSRI scale in this dataset.

The last six characters of the variable Informant_ID, consisting of four letters and two digits, were generated randomly and form the respondent ID. The other characters in the variable indicate the respondent's corpus, gender, and age group.

The dataset does not include the original date variable, as it did not function as intended across all file formats. Two new variables, Recording_time_year and Recording_time_month, have been created in the archived dataset based on the original variable. Together, these variables contain the same information as the original variable.

Painokertoimet

There are no weight variables in the data.

Viittausvaatimus

The data and its creators shall be cited in all publications and presentations for which the data have been used. The bibliographic citation may be in the form suggested by the archive or in the form required by the publication.

Malliviittaus

Kachel, Sven (University of Helsinki) & Posio, Pekka (University of Helsinki): PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023 [dataset]. Data version 1.0 (2026-02-24). Finnish Social Science Data Archive [distributor]. DOI: https://doi.org/10.60686/t-fsd4041; URN: https://urn.fi/urn:nbn:fi:fsd:T-FSD4041

Julkaisusta tiedottaminen

Notify FSD of all publications where you have used the data by sending the citation information to user-services.fsd@tuni.fi.

Erityisehdot

The research group collected both linguistic and psychological survey data. The linguistic corpus data are archived at the Language Bank of Finland and the psychological survey data with FSD. The research group exclusively holds a master key document linking informant codes across the two datasets; this document is available only from the authors of the dataset upon reasonable request.

Varaumat

The original data creators and the archive bear no responsibility for any results or interpretations arising from the reuse of the data.

Muu materiaali

Katso ladattavat tiedostot sivun ylälaidasta.

A speech and text corpus archived in the Language Bank of Finland for research on language and gender in Mexico and Spain

The corpus was collected from the same respondents who participated in the FSD4041 dataset archived at the Finnish Social Science Data Archive: PsyCoLaGe: Psychological Data for the CoLaGe Corpus from Mexico and Spain 2022-2023.

Käytön ja kuvailun oheismateriaalit

PsyCoLaGe: Data and Measurement Description. Included as an appendix to the dataset.

Julkaisut aineistosta Tooltip

Posio, P., *Kachel, S., & Uclés Ramada, G. (2024). Morphosyntactic stereotypes of speakers with different genders and sexual orientations: An experimental investigation. Linguistics, online first. https://doi.org/10.1515/ling-2022-0143

Posio, P., Kachel, S., & Uclés Ramada, G. (2025). Sociolinguistic and functional variation in the use of direct reported speech in Spanish in the corpus CoLaGe-Valencia. Spanish in Context, online first. https://doi.org/10.1075/sic.24023.pos

Uclés Ramada, G., Kachel, S., & Posio, P. J. (2025). Conflict, gender, and amount of talk: Gender differences in Spanish role play data. Pragmatics and Society. https://doi.org/10.1075/ps.23144.ucl

Aineiston kuvailu koneluettavassa DDI-C 2.5 -formaatissa

Creative Commons License
FSD:n aineistokuvailut (FSD metadata records), joiden tekijä on Suomen yhteiskuntatieteellinen tietoarkisto (Finnish Social Science Data Archive), on lisensoitu Creative Commons 1.0 Yleismaailmallinen (CC0 1.0)-lisenssillä.