GRAAL

Graph-based Reasoning Agents for Automatic Labelling

GRAAL is a research-oriented toolkit that combines graph vectorial database, knowledge reasoning, and modular agent components to support agentic automated data labelling in hierarchical classification, a common use case for National Statistical Institutes (NSIs).

The hierarchy of the nomenclature is represented as a graph database (Neo4j), enabling efficient traversal and reasoning over classification codes and their relationships.

It is intended to be used with reasoning LLM agents (that support tool-calling).

Key features

Graph (Neo4j) for representing the classification codes and relations
Neo4j-backed tools for agentic workflows
Utilities for building, embedding and managing graph data
Modular classifiers and navigators to support multi-stage reasoning pipelines

Quick start

Clone the repository:

git clone https://github.com/InseeFrLab/GRAAL.git cd GRAAL
Install uv (recommended) - via pip install uv for instance. Create a virtual environment and install dependencies :

uv sync
Run a small experiment:

uv run -m src.test

Repository layout

Important folders and files:

src/ — main Python package
- agents/ — agent implementations and subcomponents (Code2Text, Text2Code, closers)
- neo4j_graph/ — graph building and helpers for Neo4j-backed graphs
- navigator/ — navigator logic: travel from the root to leaves in the graph, explaining each step
- utils/ — utility modules (logging, parser)
presentation/ — presentation materials and templates
pyproject.toml — project metadata and dependencies

Architecture overview

At a high level, GRAAL composes three concerns:

Knowledge Graph: A graph database (Neo4j or in-memory representation) stores facts, entities and provenance. The neo4j_graph package provides builders and helpers to construct and query this graph.
Agents: Reusable agent building blocks implement specific capabilities. For instance, Code2Text takes a label (code) as input and generates synthetic texts; Text2Code assists in classifying textual specifications in the given classification.
Connectors & Utilities: Embedding helpers, DB managers and parsers make it easy to populate the graph and wire agents into pipelines.

This modular separation allows mixing and matching pieces for experiments or production prototypes.

Contributing

Contributions are welcome. A few guidelines:

Open an issue to discuss significant changes before implementing them.
Follow the existing code style and add tests for new behavior.
Keep changes focused and create feature branches for PRs.

If you add or modify graph schema, please include migration steps or a small script to populate example data.

Roadmap & ideas

Add integration tests for agent pipelines and graph persistence
Implement more example notebooks demonstrating common workflows

License

This project includes a LICENSE file in the repository root. Please refer to it for licensing details.

Authors & contact

Maintained by InseeFrLab. For questions or contributions, please open an issue or contact the maintainers via the project repository.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.github/workflows		.github/workflows
presentation		presentation
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.sh		setup.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GRAAL

Graph-based Reasoning Agents for Automatic Labelling

Key features

Quick start

Repository layout

Architecture overview

Contributing

Roadmap & ideas

License

Authors & contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

InseeFrLab/GRAAL

Folders and files

Latest commit

History

Repository files navigation

GRAAL

Graph-based Reasoning Agents for Automatic Labelling

Key features

Quick start

Repository layout

Architecture overview

Contributing

Roadmap & ideas

License

Authors & contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages