Back to publications

pytopicgram: A library for data extraction and topic modeling from Telegram channels

Gómez-Romero, Juan and Cantón-Correa, Javier and Pérez-Mercado, Rubén and Prados-Abad, Francisco and Molina-Solana, Miguel and Fajardo, Waldo
SoftwareX 30 , 102141 (2025)

Abstract:

Telegram is a popular platform for communication, generating large volumes of messages through its open channels. pytopicgram is a Python library designed to help researchers efficiently collect, organize, and analyze Telegram messages, addressing the increasing demand to understand online discourse. Key functionalities include efficient message retrieval, computation of engagement metrics, and advanced topic modeling. By automating the data extraction and analysis pipeline, pytopicgram simplifies the investigation of how content spreads, how topics evolve, and how audiences interact on Telegram. The library’s modular architecture ensures flexibility and scalability, making it suitable for diverse applications. This paper describes the design, main features, and illustrative examples that demonstrate pytopicgram’s practical effectiveness for studying public conversations.

Links:

DOI: 10.1016/j.softx.2025.102141
PDF:

Bibtex:

@article{GomezRomero2025,
  author = {Gómez-Romero, Juan and Cantón-Correa, Javier and Pérez-Mercado, Rubén and Prados-Abad, Francisco and Molina-Solana, Miguel and Fajardo, Waldo},
  title = {pytopicgram: A library for data extraction and topic modeling from Telegram channels},
  journal = {SoftwareX},
  year = {2025},
  volume = {30},
  articleno = {102141},
  doi = {10.1016/j.softx.2025.102141},
  comment = {},
  timestamp = {44}
}