Back to publications

Visualizing large knowledge graphs: A performance analysis

Gómez-Romero, Juan and Molina-Solana, Miguel and Oehmichen, Axel and Guo, Yike
Future Generation Computer Systems 89 , pp. 224–238 (2018)

Abstract:

Knowledge graphs are an increasingly important source of data and context information in Data Science. A first step in data analysis is data exploration, in which visualization plays a key role. Currently, Semantic Web technologies are prevalent for modeling and querying knowledge graphs; however, most visualization approaches in this area tend to be overly simplified and targeted to small-sized representations. In this work, we describe and evaluate the performance of a Big Data architecture applied to large-scale knowledge graph visualization. To do so, we have implemented a graph processing pipeline in the Apache Spark framework and carried out several experiments with real-world and synthetic graphs. We show that distributed implementations of the graph building, metric calculation and layout stages can efficiently manage very large graphs, even without applying partitioning or incremental processing strategies.

Links:

DOI: 10.1016/j.future.2018.06.015
PDF: https://www.doc.ic.ac.uk/ mmolinas/publications/gomez-fgcs18.pdf

Bibtex:

@article{Gomez2018,
  title = {Visualizing large knowledge graphs: A performance analysis},
  author = {G\'omez-Romero, Juan and Molina-Solana, Miguel and Oehmichen, Axel and Guo, Yike},
  journal = {Future Generation Computer Systems},
  year = {2018},
  volume = {89},
  pages = {224--238},
  doi = {10.1016/j.future.2018.06.015},
  issn = {0167-739X},
  comment = {https://www.doc.ic.ac.uk/~mmolinas/publications/gomez-fgcs18.pdf},
  timestamp = {13}
}