Knowledge graphs serve as a powerful bridge, translating complex information across various domains and languages. Unlike traditional mathematics or programming, they offer a uniquely accessible way for both humans and AI to comprehend and interact with data.
Human Language
Knowledge graphs organize information in a way that aligns intuitively with human thought and communication patterns, making complex data digestible.
Mathematics
They embody mathematical principles yet remain easily translatable into human-understandable narratives, overcoming traditional communication barriers.
Programming
Bridging the gap to code, knowledge graphs enable seamless integration and interpretation of data for software applications and development processes.
AI & LLMs
Modern AI and Large Language Models natively process and generate information in knowledge graph structures, enhancing their interpretability and reasoning.
Knowledge Graphs for Cross-Domain Communication
Rosetta Stone
A 2nd-century BCE granodiorite stele inscribed with the same decree in three scripts—Egyptian hieroglyphs, Demotic, and Ancient Greek—whose discovery in 1799 unlocked the secrets of hieroglyphic writing.
Knowledge Graphs
A 21st-century digital illustration depicting a central knowledge graph hub interconnected with four domains—Human Language, Mathematics, Programming, and AI Systems—whose conceptual clarity positions it as a modern Rosetta Stone, enabling translation and communication across natural, formal, computational, and artificial intelligence languages.
Knowledge Graph Variety
Humans have implicitly and explicitly constructed knowledge graphs for millennia to supplement language, capture complex concepts, and organize the world. These foundational structures illustrate the incredible flexibility and timeless relevance of interconnected data.
Hierarchical Systems
From ancient taxonomies classifying plants and animals to organizational charts, early knowledge graphs mapped "is-a" relationships, providing clarity and structure to complex domains.
Spatial Representations
The earliest maps and navigational charts visually represented locations and connections, forming spatial knowledge graphs crucial for trade, exploration, and understanding geography.
Genealogical Trees
Family trees, lineage records, and tribal relationships form intricate knowledge graphs tracing ancestry and kinship, preserving cultural identity and historical links across generations.
Knowledge Graph Extraction
Let's start by constructing a simple graph.
Extract a knowledge graph from these sentences: France is in Europe. Europe is on Earth. Paris is in France. The Eiffel tower is a landmark in Paris.
Here's a concise representation of the extracted knowledge graph.
Nodes (Entities):
France
Europe
Earth
Paris
The Eiffel Tower
From Graph to Narrative: Location Hierarchy
Understanding spatial relationships is key. This knowledge graph illustrates a "located-in" hierarchy for the Eiffel Tower, mapping its location from Paris within France, then Europe, and finally, on Earth. Each connection clearly defines the relationship, like "is in" or "is on," with node sizes and colors visually representing scale from local to global.
Prompt to LLM: Can you write a little story that includes all the information stored in the graph?
LLM Output: On the pale blue planet Earth, a great landmass known as Europe stretches across the northern hemisphere. Within this continent lies the storied nation state of France, famed for its art, cuisine, and history. At the very heart of France sits Paris, a vibrant city of boulevards and cafés. And rising above that cityscape is the Eiffel Tower, the iron landmark that has come to symbolize the spirit of Paris itself.
The LLM enriches the knowledge graph by adding information learned during pretraining. By extracting and validating these new entities and relationships, we enhance our knowledge graph, empowering AI with deeper insights and stronger reasoning abilities.
Yet, knowledge graphs aren’t exhaustive. For instance, the concept "nation state," accurately describing France in the text, was omitted. Whether we choose to include terms like "nation state" depends entirely on the intended purpose of the graph. Just as humans selectively use language, we deliberately exclude certain details from our knowledge graphs to keep them relevant, focused, and effective.
Knowledge graphs are not limited to a single domain or data type. Their versatility allows them to represent a vast array of information, from scientific discoveries to social networks, and even historical records.
Textual Data
Information extracted from documents, articles, and literature forms the foundation for graphs that map concepts, entities, and relationships in human language.
Structured Data
Databases, spreadsheets, and tabular information can be transformed into graphs, revealing hidden connections and patterns that are not immediately apparent.
Historical Records
Ancient texts, maps, and artifacts can be digitized and analyzed to construct historical knowledge graphs, preserving and connecting insights from the past.
Geospatial Data
Location-based information, including maps and geographical coordinates, can create powerful graphs that illuminate spatial relationships and navigation paths.
The ability to create new types of knowledge graphs from diverse data sources constantly expands their utility, offering powerful ways to organize, analyze, and tell stories about complex information.
The following examples show the diverse type of knowledge graphs.
Ancient Navigational Knowledge Graphs
Marshall Islands stick charts are remarkable examples of early knowledge representation. These intricate wooden grids, known as "Rebbilib," "Meddo," or "Mattang," were not maps in the traditional sense, but sophisticated navigational tools. They encoded complex information about ocean swells, wave patterns, and island locations, crucial for long-distance canoe voyages across the Pacific.
Each stick and shell on the chart represented specific relationships and entities – islands as shells, and sticks depicting wave patterns or currents. This rich "graph" of oceanic knowledge was often combined with oral histories and songs, transforming abstract data into vivid narratives that guided mariners across vast stretches of water. They illustrate how complex spatial knowledge can be encoded and transmitted.
Transit maps are quintessential knowledge graphs, abstracting complex urban landscapes into clear, navigable networks. They represent stations as nodes and routes as edges, simplifying geographical accuracy to emphasize connectivity and flow. This schematic approach allows millions to effortlessly understand and navigate intricate public transportation systems daily, from subways to buses, highlighting the practical power of simplified data relationships. https://en.wikipedia.org/wiki/Transit_map
The Tabula Peutingeriana: A Roman Road Map
The Tabula Peutingeriana is an extraordinary surviving copy of an ancient Roman road map, detailing the vast network of roads, cities, and significant landmarks of the Roman Empire. It works very much like a modern transit map. Unlike modern maps, it was a long parchment scroll, distorted to emphasize routes and distances rather than geographical accuracy. It served as a vital knowledge graph, connecting key locations and facilitating travel and logistics across the empire.
This remarkable artifact encoded complex spatial and logistical information. Cities, settlements, rivers, and mountains are represented as nodes, while the roads connecting them serve as edges, illustrating how the Romans visualized their interconnected world for efficient communication and movement of goods and armies.
Decision Trees are an illustration of how knowledge graphs can contain reasoning information. Each node represents an entity or a decision point, such as "Applicant," "Credit Score," or "Loan Approved." The branches, or edges, signify the relationships and conditions that lead from one state to the next, like "has score > 700" or "income > $50K." This inherent structure makes them a natural fit for knowledge graph representation, providing a clear, interpretable model for automated reasoning.
Consider this simple loan approval tree as an example of how decision logic maps directly to interconnected nodes and edges within a knowledge graph.
Automated decision‐tree construction algorithms represents an early form of “old‐school” AI–driven knowledge‐graph generation, where nodes correspond to attribute tests and edges to branch outcomes—structuring data into an explicit graphical model.
In contrast, modern approaches span a much broader spectrum: from rule‐mining algorithms and embedding‐based link prediction to large‐language‐model–powered extractors that can ingest unstructured text and output richly annotated, semantically linked graphs. These advanced techniques build on the same principle—automating the discovery of structure in data—but now leverage deep learning and contextual understanding to generate knowledge graphs far more flexibly and with nuanced relationships than simple threshold splits.
Exploring Modern Knowledge Graphs
As we've seen, knowledge graphs have evolved significantly from ancient navigational tools and early AI models. To delve deeper into contemporary applications, advanced methodologies, and the future of interconnected data, explore the next chapter.