Home Technologies Microsoft GraphRAG for questions that span your whole archive
Knowledge graph retrieval

Microsoft GraphRAG for questions that span your whole archive

What it is & where it fits

How QuantalAI uses Microsoft GraphRAG for questions that span your whole archive.

Answers that join facts across hundreds of documents, plus reliable summaries of a whole archive, where plain search returns scattered snippets and misses the link. That is what GraphRAG delivers when the build is done right. It gets there with a knowledge graph drawn from your own material. We extract the people, matters, products and events in your documents and the relationships between them, then retrieve over that structure instead of text similarity alone. A wrong graph is worse than no graph, so we check extraction against your real records and benchmark it against plain RAG. You pay for graph retrieval only where it changes the answer.

Book a discovery call

Where teams get stuck with their own documents

You have an archive that holds the answer, and your team still cannot get it out quickly. The contract, the matter, the incident history is all in there, spread across hundreds of files. When the question is broad, like how a supplier relates to three other parties across years of correspondence, ordinary search hands back a handful of snippets that each look relevant and none of which connect.

Plain RAG works by matching text that resembles your question, which is fine when the answer sits in one place. It breaks down on questions that take hours of cross-referencing, because the link you need is never spelled out in any single passage. The work then falls back to a senior staff member holding the connections in their head, which is slow and leaves when they do.

Why buying GraphRAG off the shelf under-delivers

Microsoft GraphRAG is open source and free to download, which is why teams reach for it too early. Cloning the repository is not the hard part. Making it answer your questions reliably is.

The first trap is extraction quality. GraphRAG builds its graph by asking a language model to pull entities and relationships out of every document. On generic settings it will invent a relationship that reads well and is wrong, or split one client across three slightly different names. A graph built on that retrieves confident, incorrect connections, which is worse than a plain search that simply misses, because the answer looks authoritative. The fix is unglamorous, tuning the extraction to your domain language and sampling it against your records until it holds up.

The second trap is treating the graph as a one-off. Your documents change, and a graph that was right in March is stale by June, so it needs a refresh job and a cost model. The third trap is running it everywhere, where on simple lookups GraphRAG adds build time and compute for an answer plain RAG gives for less.

How we deliver it

We connect GraphRAG to your own documents and decisions, because retrieval is only as good as the material behind it. That principle of AI-accessible internal data is the whole point. The graph is drawn from your records, so answers are about your business, not the public web.

  1. Right-size it first. Before any build we study the questions you need answered. Relationship-heavy or collection-wide questions make GraphRAG a candidate. Simple lookups go to plain RAG.
  2. Build the graph on a focused slice. We extract entities and relationships from a contained set of your content, tune the extraction to your terms, and sample it against source documents before going wider.
  3. Stand up local and global search. Local search handles specific lookups, global search over community summaries handles broad questions, and plain retrieval is the fall-back where the graph adds nothing.
  4. Benchmark against plain RAG. We score GraphRAG against an ordinary RAG baseline on your real questions, so the extra cost is justified by measured gain.
  5. Set the refresh and guard rails. Answers cite their source nodes, a person reviews high-stakes output, and the graph rebuilds on a schedule you can afford.

A knowledge graph of legal matters and parties with community clusters highlighted, beside a query routing to the right retrieval path

How we run the system matters as much as how we build it. We keep the extraction prompts, community settings and retrieval choices under version control, with an evaluation harness that re-scores the graph whenever they change. That is version-controlled prompts and evaluations in GraphRAG terms, and it answers the only question that counts. Did this change make the answers better or worse. When a tweak drops the score, we revert it.

We also build it to run, not to demo. The graph lives in a proper store such as Neo4j that you can query and inspect, hosted in an Australian region, with the refresh, monitoring and access controls of a quality internal platform. A graph that breaks the first time the corpus grows is unfinished work.

When to choose GraphRAG, and when not to

Choose it when your questions are about connections. Investigations, due diligence, complex litigation, fraud analysis and large technical or regulatory libraries are where graph retrieval pays off, because the value is in tracing relationships a person would otherwise chase by hand. Choose it too when you need to make sense of a whole collection, where community summaries do work snippet retrieval cannot.

Do not choose it for everyday lookup. If your team mostly needs one fact from one document, plain RAG is cheaper to build, simpler to maintain and just as accurate. Be wary of the refresh cost too, since a graph over fast-changing content rebuilds often. In practice we often land on a blend, with graph retrieval reserved for the questions that need it, and we tell you which beforehand.

Where GraphRAG fits in our work

GraphRAG is one retrieval technique we build into larger systems. See how it supports AI agents, artificial intelligence grounded in your data, and data insights and analysis across large archives. It earns its keep most often in Professional Services, FinTech and Banking, Insurance and Government.

Capabilities

What we build with GraphRAG

01

Entity and relationship extraction from your corpus

Pipelines that read your contracts, case files or technical libraries and build the graph of who and what relates to whom. We tune the extraction to your domain terms so the entities are yours, not a generic guess, and sample the output against source documents first.

02

Community summaries across the full collection

GraphRAG groups related entities into communities and writes a summary for each. That is how you ask a question of the whole archive at once and get a themed answer, instead of the few passages that matched the wording of your query.

03

Local and global query routing

Specific lookups go to local search over nearby entities. Broad, sense-making questions go to global search over the community summaries. We set the routing so each question takes the path that answers it, falling back to plain retrieval when the graph adds nothing.

04

Graph store and refresh on Neo4j or a managed equivalent

The graph lives in a store you can inspect, query directly and host in an Australian region. We build the refresh so new documents update the graph on a schedule, and cost that rebuild honestly, since it is the main thing that separates GraphRAG economics from plain RAG.

05

Side-by-side evaluation against a plain RAG baseline

Before GraphRAG goes near production we score it against ordinary RAG on your own questions. If the graph does not beat the baseline on the questions that matter, we say so and recommend the cheaper option.

About Microsoft GraphRAG for questions that span your whole archive

Microsoft GraphRAG for questions that span your whole archive is a ai technique that QuantalAI builds and integrates for Australian organisations. Learn more at the official source: https://github.com/microsoft/graphrag.

No stupid questions

Frequently asked.

Is GraphRAG dead?
No, but the hype around it has cooled, which is healthy. GraphRAG was never meant to replace ordinary RAG for every job. It earns its place on relationship-heavy and whole-collection questions, and it is still developed and used there. What has faded is the idea that every retrieval system needs a graph.
Is GraphRAG open source?
Yes. Microsoft publishes GraphRAG under an open licence on GitHub, and there are open toolkits and graph stores such as Neo4j that pair with it. Open source means you can read the code, host it yourself and avoid lock-in. It does not mean the system runs itself. The engineering that makes it dependable on your documents is still yours to do.
Is GraphRAG free?
The software is free to use. The running of it is not. Building the graph calls a language model over every document, the graph store needs hosting, and the graph has to be rebuilt as your content changes. Those are recurring costs in AUD. We size them in discovery so you can weigh the spend against the value first.
Is GraphRAG better than RAG?
Only for the right questions. GraphRAG wins when answers depend on connections across many documents, or when you need to summarise themes across a whole archive. For straightforward lookups, plain RAG is cheaper and just as accurate. Better depends on what you are asking, which is why we benchmark both on your real questions.
Is GraphRAG good?
It is very good at a narrow set of jobs and poor value everywhere else. On investigations, due diligence, complex case work and large regulatory libraries it answers questions plain search cannot. On simple lookup it adds cost for no gain. A graph built on sloppy extraction is worse than plain search, so the build quality decides whether it is good in practice.
Is GraphRAG useful?
Useful when your questions are about relationships or whole-collection patterns, and when the documents hold connections a person would otherwise trace by hand for hours. Less useful when your team mostly needs one fact from one document. We work out which case you are in during discovery, and tell you plainly if the cheaper answer is the right one.
Is GraphRAG still relevant?
Yes, for the work it suits. The novelty has worn off, but the underlying need, answering questions that span a body of connected information, has not gone anywhere. Relevance for you is about your documents and your queries, not the trend cycle.
Is GraphRAG a knowledge graph?
GraphRAG builds and uses a knowledge graph, but it is more than the graph alone. It is the full method of extracting entities and relationships, grouping them into communities, summarising those communities, and routing queries to the right retrieval path. The graph is the foundation, and the retrieval and summarisation on top turn it into answers.
Take the next step

Find out if graph retrieval actually fits your archive

Send us the questions your team traces by hand across dozens of documents. We will tell you whether GraphRAG earns its build cost on your material, or whether plain RAG answers them for less.

Book a discovery call