Textual analysis of scientific articles published on Colombian fossils

Authors

DOI:

https://doi.org/10.5007/1518-2924.2022.e83470

Keywords:

Colombia, Iramuteq, Lexicon, Paleontology

Abstract

Objective: Identify the lexical proximities in a corpus of texts of scientific articles published in academic journals indexed in the Scopus database on Colombian fossils.

Method:  This work applies textual analysis to five paleontological articles on Colombian fossils to identify lexical proximity in a corpus of texts.  This work allowed us to determine:  the grammatical categories, the proximity between categories of words and variables with the analysis of specificities (AE), the grouping of the words with the study of the descending hierarchical classification (CJD) and the graphic presentation of the words.

Results: The documentary corpus comprises 31,319- word occurrences, 1,450 active forms or specific words and 303 complimentary forms or common words. The grammatical category of nouns predominates (24%) and words not recognized in the dictionary (17%). The familiar words with the highest frequencies are articles, conjugations, propositions, and pronouns.

Conclusions: It was found that there is linguistic proximity between article 1 and the active forms of “Colombia” and article 2 and the active forms of “fossil”. The words were grouped into five classes, and the word cloud was created with 1271 words.

Downloads

Download data is not yet available.

Author Biographies

Dr., Universidad de Córdoba (Colombia)División de Bibliotecas y Recursos Educativos

Doctor in Library Science and Information Studies from the National Autonomous University of Mexico and Master in Library Science from El Colegio de México. The lines of research that he develops are: bibliometrics, scientometrics, science evaluation and information organization. More than 10 years of work experience in academic libraries.

PhD., University EAFIT

He is a Geologist from the National University of Colombia in 2004, a Dr. in Geology from the University of South Florida, USA in 2012 and a PhD. in Paleobiology from the Smithsonian Research Institute
tropical trees in 2013. He is currently developing teaching activities in the Department of Earth Sciences of the EAFIT University.

References

BARCELLINI, Flore; DELGOULET, Catherine; NELSON, Julien. Are online discussions enough to constitute communities of practice in professional domain? a case study of ergonomics’ practice in France. Cognition, Technology & Work, v. 18, n. 2, p. 249-266, 2016.

BENZÉCRI, Jean Paul. L' Analyse des Correspondances. En : L'Analyse des Données, Tomo II. 2de. Éd. París: Dunod, 1976.

CÉSARI, Matilde Inés. Protocolo de análisis de datos textuales aplicados a la minería de textos. 1a edición para el alumno. Ciudad Autónoma de Buenos Aires: Universidad Tecnológica Nacional. Facultad Regional Mendoza, 2017.

CONTRERAS BARRERA, Marcial. Minería de texto en la clasificación de material bibliográfico. Biblios, n. 64, p. 33-43, 2016.

FEBLES RODRÍGUEZ, Juan Pedro; GONZÁLEZ PÉREZ, Abel. Aplicación de la minería de datos en la bioinformática. Acimed, v. 10, n. 2, p. 69-76, 2002.

FERREIRA, Márcio Henrique Wanderley; CORRÊA, Renato Fernandes. Estudo métrico sobre biblioteca digital: uso do software Iramuteq. En: ENCONTRO NACIONAL DE PESQUISA EM CIÊNCIA DA INFORMAÇÃO, 19, 2018, Londirna, Anais [...], Londrina: UEL, 2018.

GÁLVEZ, Carmen. Minería de textos: la nueva generación de análisis de literatura científica en biología molecular y genómica. Encontros Bibli: revista eletrônica de biblioteconomia e ciência da informação, v. 13, n. 25, p. 1-14, 2008.

IEZZI, D. F.; CELARDO, L. Text Analytics: Present, Past and Future. In INTERNATIONAL CONFERENCE ON THE STATISTICAL ANALYSIS OF TEXTUAL DATA (pp. 3-15). Cham: Springer, 2018.

IRAMUTEQ. Bienvenida. 2021. Disponible en: http://www.iramuteq.org/ Acceso en: 21 may. 2022.

JARAMILLO VALBUENA, Sonia; CARDONA, Sergio Augusto; FERNÁNDEZ, Alejandro. Minería de datos sobre streams de redes sociales, una herramienta al servicio de la Bibliotecología. Información, cultura y sociedad, n. 33, p. 63-74, 2015.

MARIÑELARENA-DONDENA, Luciana; ERRECALDE, Marcelo Luis; CASTRO SOLANO, Alejandro. Extracción de conocimiento con técnicas de minería de textos aplicadas a la psicología. Revista Argentina de Ciencias del Comportamiento, v. 9, n. 2, p. 65-76, 2017.

MORALES DEL RÍO, Cecilia. M. Uso de software lexical: una revisión comparativa. En: CONGRESO DE LA RED INTERNACIONAL DE INVESTIGADORES EN COMPETITIVIDAD, 13, 2019, Anais [...], 2019. Disponible en: https://riico.net/index.php/riico/article/view/1794 Acceso en: 21 may. 2022

PENG, T. Q.; ZHANG, L.; ZHONG, Z. J.; ZHU, J. J. Mapping the landscape of Internet studies: Text mining of social science journal articles 2000–2009. New Media & Society, v. 15, n. 5, p. 644-664, 2013.

REINERT, Max. Une méthode de classification descendante hiérarchique: application à l'analyse lexicale par contexte. Cahiers de l'Analyse des Données, v. 8, n. 2, p. 187-198, 1983.

REINERT, Max. Un logiciel d'analyse lexicale. Cahiers de l'Analyse des Données, v. 11, n. 4, p. 471-481, 1986.

REINERT, Max. Quelques aspects du choix des unités d'analyse et de leur contrôle dans la méthode Alceste. Journées Internationales d´Analyse Statistique des Données Textuelles (JADT), v. 1, p. 27-34, 1995.

REINERT, Max. Quel objet pour une analyse statistique du discours? Quelques réflexions à propos de la réponse Alceste. Journées Internationales d´Analyse Statistique des Données Textuelles (JADT). Lexicometrica, p. 557-569, 1998.

REINERT, Max. Mondes lexicaux stabilisés et analyse statistique de discours. Actes de la JADT. Journées Internationales d´Analyse Statistique des Données Textuelles (JADT). p. 981-993, 2008

RIZZOLI, Valentina. Histories of Social Psychology in Europe and North America, as Seen from Research Topics in Two Key Journals. In: Tracing the Life Cycle of Ideas in the Humanities and Social Sciences (pp. 65-86). Cham: Springer, 2018.

TAISE HOFFMANN, Yohana; BISSET ALVAREZ, Edgar; MARTÍ-LAHERA, Yohannis. Análise textual com IRaMuTeQ de pesquisas recentes em História da educação matemática no Brasil: um exemplo de Humanidades Digitais. Investigación bibliotecológica, v. 34, n. 84, p. 103-133, 2020.

URBIZAGÁSTEGUI ALVARADO, Rubén; RESTREPO ARANGO, Cristina. La ley de Zipf y el punto de transición de Goffman en la indización automática. Investigación bibliotecológica, v. 25, n. 54, p. 1-15, 2011.

YAN, B. N.; LEE, T. S.; LEE, T. P. Analysis of research papers on E-commerce (2000–2013): based on a text mining approach. Scientometrics, v. 105, n. 1, p. 403-417, 2015.

SALVADOR, Pétala Tuani Candido de Oliveira; GOMES, Andréa Tayse de Lima; RODRIGUES, Cláudia Cristiane Filgueira Martins; CHIAVONE, Flávia Barreto Tavares; ALVES, Kisna Yasmin Andrade; BEZERRIL, Manacés dos Santos; SANTOS, Viviane Euzébia Pereira. Uso do software iramuteq nas pesquisas brasileiras da área da saúde: uma scoping review. Revista Brasileira em Promoção da Saúde, n. 31, 2018.

SALVIATI, Maria Elisabeth. Manual do Aplicativo Iramuteq: (versão 0.7 Alpha 2 e R Versão 3.2.3). Planaltina: [Sin editor], 2017.

SAMPAIO, Ricardo B.; FONSECA, Bruna P.; BAHULKAR, Ashwin; SZYMANSKI, Boleslaw K. Network analysis to support public health: evolution of collaboration among leishmaniasis researchers. Scientometrics, n. 111, v. 3, 2001-2021, 2017.

SPOLAORE, Giuseppe; GIARETTA, Pierdaniele. Tracing the Words of the Analytic Turn in the Journal of Philosophy. In: Tracing the Life Cycle of Ideas in the Humanities and Social Sciences (pp. 25-44). Cham: Springer, 2018.

SOUZA, Marli Aparecida Rocha de; WALL, Marilene Loewen; THULER, Andrea Cristina de Morais Chave; LOWEN, Ingrid Margareth Voth; PERES, Aida Maris. O uso do software IRAMUTEQ na análise de dados em pesquisas qualitativas. Revista da Escola de Enfermagem da USP, v. 52, 2018.

ZANJIRCHI, Seyed Mahmoud; ABRISHAMI, Mina; JALILIAN, Negar. Four decades of fuzzy sets theory in operations management: application of life-cycle, bibliometrics and content analysis. Scientometrics, v. 119, n. 3, 1289-1309, 2019.

Published

2022-07-11

How to Cite

RESTREPO-ARANGO, Cristina; CÁRDENAS-ROZO, Andrés L. Textual analysis of scientific articles published on Colombian fossils. Encontros Bibli: revista eletrônica de biblioteconomia e ciência da informação, [S. l.], v. 27, n. 1, p. 1–25, 2022. DOI: 10.5007/1518-2924.2022.e83470. Disponível em: https://periodicos.ufsc.br/index.php/eb/article/view/83470. Acesso em: 21 may. 2024.

Similar Articles

<< < 1 2 3 4 > >> 

You may also start an advanced similarity search for this article.