Textual analysis of scientific articles published on Colombian fossils
DOI:
https://doi.org/10.5007/1518-2924.2022.e83470Keywords:
Colombia, Iramuteq, Lexicon, PaleontologyAbstract
Objective: Identify the lexical proximities in a corpus of texts of scientific articles published in academic journals indexed in the Scopus database on Colombian fossils.
Method: This work applies textual analysis to five paleontological articles on Colombian fossils to identify lexical proximity in a corpus of texts. This work allowed us to determine: the grammatical categories, the proximity between categories of words and variables with the analysis of specificities (AE), the grouping of the words with the study of the descending hierarchical classification (CJD) and the graphic presentation of the words.
Results: The documentary corpus comprises 31,319- word occurrences, 1,450 active forms or specific words and 303 complimentary forms or common words. The grammatical category of nouns predominates (24%) and words not recognized in the dictionary (17%). The familiar words with the highest frequencies are articles, conjugations, propositions, and pronouns.
Conclusions: It was found that there is linguistic proximity between article 1 and the active forms of “Colombia” and article 2 and the active forms of “fossil”. The words were grouped into five classes, and the word cloud was created with 1271 words.
Downloads
References
BARCELLINI, Flore; DELGOULET, Catherine; NELSON, Julien. Are online discussions enough to constitute communities of practice in professional domain? a case study of ergonomics’ practice in France. Cognition, Technology & Work, v. 18, n. 2, p. 249-266, 2016.
BENZÉCRI, Jean Paul. L' Analyse des Correspondances. En : L'Analyse des Données, Tomo II. 2de. Éd. París: Dunod, 1976.
CÉSARI, Matilde Inés. Protocolo de análisis de datos textuales aplicados a la minería de textos. 1a edición para el alumno. Ciudad Autónoma de Buenos Aires: Universidad Tecnológica Nacional. Facultad Regional Mendoza, 2017.
CONTRERAS BARRERA, Marcial. Minería de texto en la clasificación de material bibliográfico. Biblios, n. 64, p. 33-43, 2016.
FEBLES RODRÍGUEZ, Juan Pedro; GONZÁLEZ PÉREZ, Abel. Aplicación de la minería de datos en la bioinformática. Acimed, v. 10, n. 2, p. 69-76, 2002.
FERREIRA, Márcio Henrique Wanderley; CORRÊA, Renato Fernandes. Estudo métrico sobre biblioteca digital: uso do software Iramuteq. En: ENCONTRO NACIONAL DE PESQUISA EM CIÊNCIA DA INFORMAÇÃO, 19, 2018, Londirna, Anais [...], Londrina: UEL, 2018.
GÁLVEZ, Carmen. Minería de textos: la nueva generación de análisis de literatura científica en biología molecular y genómica. Encontros Bibli: revista eletrônica de biblioteconomia e ciência da informação, v. 13, n. 25, p. 1-14, 2008.
IEZZI, D. F.; CELARDO, L. Text Analytics: Present, Past and Future. In INTERNATIONAL CONFERENCE ON THE STATISTICAL ANALYSIS OF TEXTUAL DATA (pp. 3-15). Cham: Springer, 2018.
IRAMUTEQ. Bienvenida. 2021. Disponible en: http://www.iramuteq.org/ Acceso en: 21 may. 2022.
JARAMILLO VALBUENA, Sonia; CARDONA, Sergio Augusto; FERNÁNDEZ, Alejandro. Minería de datos sobre streams de redes sociales, una herramienta al servicio de la Bibliotecología. Información, cultura y sociedad, n. 33, p. 63-74, 2015.
MARIÑELARENA-DONDENA, Luciana; ERRECALDE, Marcelo Luis; CASTRO SOLANO, Alejandro. Extracción de conocimiento con técnicas de minería de textos aplicadas a la psicología. Revista Argentina de Ciencias del Comportamiento, v. 9, n. 2, p. 65-76, 2017.
MORALES DEL RÍO, Cecilia. M. Uso de software lexical: una revisión comparativa. En: CONGRESO DE LA RED INTERNACIONAL DE INVESTIGADORES EN COMPETITIVIDAD, 13, 2019, Anais [...], 2019. Disponible en: https://riico.net/index.php/riico/article/view/1794 Acceso en: 21 may. 2022
PENG, T. Q.; ZHANG, L.; ZHONG, Z. J.; ZHU, J. J. Mapping the landscape of Internet studies: Text mining of social science journal articles 2000–2009. New Media & Society, v. 15, n. 5, p. 644-664, 2013.
REINERT, Max. Une méthode de classification descendante hiérarchique: application à l'analyse lexicale par contexte. Cahiers de l'Analyse des Données, v. 8, n. 2, p. 187-198, 1983.
REINERT, Max. Un logiciel d'analyse lexicale. Cahiers de l'Analyse des Données, v. 11, n. 4, p. 471-481, 1986.
REINERT, Max. Quelques aspects du choix des unités d'analyse et de leur contrôle dans la méthode Alceste. Journées Internationales d´Analyse Statistique des Données Textuelles (JADT), v. 1, p. 27-34, 1995.
REINERT, Max. Quel objet pour une analyse statistique du discours? Quelques réflexions à propos de la réponse Alceste. Journées Internationales d´Analyse Statistique des Données Textuelles (JADT). Lexicometrica, p. 557-569, 1998.
REINERT, Max. Mondes lexicaux stabilisés et analyse statistique de discours. Actes de la JADT. Journées Internationales d´Analyse Statistique des Données Textuelles (JADT). p. 981-993, 2008
RIZZOLI, Valentina. Histories of Social Psychology in Europe and North America, as Seen from Research Topics in Two Key Journals. In: Tracing the Life Cycle of Ideas in the Humanities and Social Sciences (pp. 65-86). Cham: Springer, 2018.
TAISE HOFFMANN, Yohana; BISSET ALVAREZ, Edgar; MARTÍ-LAHERA, Yohannis. Análise textual com IRaMuTeQ de pesquisas recentes em História da educação matemática no Brasil: um exemplo de Humanidades Digitais. Investigación bibliotecológica, v. 34, n. 84, p. 103-133, 2020.
URBIZAGÁSTEGUI ALVARADO, Rubén; RESTREPO ARANGO, Cristina. La ley de Zipf y el punto de transición de Goffman en la indización automática. Investigación bibliotecológica, v. 25, n. 54, p. 1-15, 2011.
YAN, B. N.; LEE, T. S.; LEE, T. P. Analysis of research papers on E-commerce (2000–2013): based on a text mining approach. Scientometrics, v. 105, n. 1, p. 403-417, 2015.
SALVADOR, Pétala Tuani Candido de Oliveira; GOMES, Andréa Tayse de Lima; RODRIGUES, Cláudia Cristiane Filgueira Martins; CHIAVONE, Flávia Barreto Tavares; ALVES, Kisna Yasmin Andrade; BEZERRIL, Manacés dos Santos; SANTOS, Viviane Euzébia Pereira. Uso do software iramuteq nas pesquisas brasileiras da área da saúde: uma scoping review. Revista Brasileira em Promoção da Saúde, n. 31, 2018.
SALVIATI, Maria Elisabeth. Manual do Aplicativo Iramuteq: (versão 0.7 Alpha 2 e R Versão 3.2.3). Planaltina: [Sin editor], 2017.
SAMPAIO, Ricardo B.; FONSECA, Bruna P.; BAHULKAR, Ashwin; SZYMANSKI, Boleslaw K. Network analysis to support public health: evolution of collaboration among leishmaniasis researchers. Scientometrics, n. 111, v. 3, 2001-2021, 2017.
SPOLAORE, Giuseppe; GIARETTA, Pierdaniele. Tracing the Words of the Analytic Turn in the Journal of Philosophy. In: Tracing the Life Cycle of Ideas in the Humanities and Social Sciences (pp. 25-44). Cham: Springer, 2018.
SOUZA, Marli Aparecida Rocha de; WALL, Marilene Loewen; THULER, Andrea Cristina de Morais Chave; LOWEN, Ingrid Margareth Voth; PERES, Aida Maris. O uso do software IRAMUTEQ na análise de dados em pesquisas qualitativas. Revista da Escola de Enfermagem da USP, v. 52, 2018.
ZANJIRCHI, Seyed Mahmoud; ABRISHAMI, Mina; JALILIAN, Negar. Four decades of fuzzy sets theory in operations management: application of life-cycle, bibliometrics and content analysis. Scientometrics, v. 119, n. 3, 1289-1309, 2019.
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 Cristina Restrepo-Arango, Andrés L. Cárdenas-Rozo
This work is licensed under a Creative Commons Attribution 4.0 International License.
The author must guarantee that:
- there is full consensus among all the coauthors in approving the final version of the document and its submission for publication.
- the work is original, and when the work and/or words from other people were used, they were properly acknowledged.
Plagiarism in all of its forms constitutes an unethical publication behavior and is unacceptable. Encontros Bibli has the right to use software or any other method of plagiarism detection.
All manuscripts submitted to Encontros Bibli go through plagiarism and self-plagiarism identification. Plagiarism identified during the evaluation process will result in the filing of the submission. In case plagiarism is identified in a manuscript published in the journal, the Editor-in-Chief will conduct a preliminary investigation and, if necessary, will make a retraction.
This journal, following the recommendations of the Open Source movement, provides full open access to its content. By doing this, the authors keep all of their rights allowing Encontros Bibli to publish and make its articles available to the whole community.
Encontros Bibli content is licensed under a Creative Commons Attribution 4.0 International License.
Any user has the right to:
- Share - copy, download, print or redistribute the material in any medium or format.
- Adapt - remix, transform and build upon the material for any purpose, even commercially.
According to the following terms:
- Attribution - You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions - You may not apply legal terms or technological measures that legally restrict others from doing anything that the license permits.