Feeding Your AI the Right Data: Meet RAG (Part 2 of 3)

Keith A. McFarland

Published Jul 23, 2025

Key Takeaways:

Why AI needs your data: Out-of-the-box AI models don’t know your latest project details. They’re trained on public info that can be outdated or too general. Without your data, an AI’s advice on your project is mostly educated guesswork.
RAG to the rescue: Retrieval-Augmented Generation (RAG) is a technique to supply AIs with relevant information on the fly. Think of it as handing your AI a briefing folder before it answers a question, so it has the facts straight .
Benefits of RAG: By grounding AI responses in your documents, RAG can reduce hallucinations (those confident wrong answers) and keep answers up-to-date and project-specific . It’s like giving the AI a memory upgrade filled with your company’s knowledge.
How RAG works (in simple terms): RAG pulls info from a predefined data source (your files, wikis, emails) related to a query, and the AI uses that info to craft its answer. No retraining needed, it’s a clever shortcut to make AI smarter about your world.
Empowering your team: With RAG, your whole team (not just the CTO) can leverage AI that “knows” your project context, leading to faster decisions and less back-and-forth searching for info.

🧐 The Case of the Clueless AI (Why Public Data Isn’t Enough)

Traditional large language models (the brains behind AI chatbots) learn from a fixed dataset (e.g., everything published on the internet up to a certain cut-off date). They don’t automatically know what happened after that cut-off, nor do they inherently understand your specific company’s data. Expecting them to know your project specifics is like expecting a new hire to intuitively know all your company’s in-jokes and protocols on day one. Not gonna happen.

Moreover, these models can be over-confident. If they don’t know something, they might fill in the blanks with a best guess. (Ever had that one coworker who, when they don’t know the answer, just makes something up very confidently? Yes, like that.) This tendency is what we lovingly call hallucination in AI.

So how do we fix this? How do we get our AI from clueless to clued-in?

🔧 RAG: Giving Your AI the Inside-scoop (a.k.a. Cheat Sheet)

Enter Retrieval-Augmented Generation, thankfully nicknamed RAG. RAG is a fancy term for a straightforward idea: before an AI answers your question, it first retrieves relevant information from a source you provide, and then uses that info to generate its answer. This is how I like to take tests, open-book or in this case your cheat sheet.

Think of RAG as hiring a smart research assistant for your AI. Here’s how it works in plain English:

You ask a question. (Example: “Hey AI, what’s the status of Project X’s budget?”)
The AI searches your data. Using a retrieval system, the AI combs through a defined set of documents/data (maybe your project SharePoint, Confluence pages, or a contract folder) to find the bits and pieces about “Project X budget”.
It finds relevant info. Maybe it pulls up a Q2 budget report and last week’s project status update where budget was mentioned.
It generates an answer using those snippets. Now the AI isn’t just spitting out generic budgeting advice, it’s crafting a response citing the actual numbers and context from your files. Essentially, the AI says: “Based on the project reports, we’ve spent 60% of the budget in Q2, and we’re on track, but there’s a note about unexpected vendor costs.”

The magic here is that the AI’s answer is grounded in your real data. Boom, instantly relevant. It’s no longer just guessing. It’s referring to the cheat sheet you allowed it to peek at. And the best part? You didn’t have to manually program those facts into the AI beforehand or wait for a new model training cycle. RAG does this on the fly, which is way more practical.

💡 Why RAG is a Game-Changer

By now, you might be thinking, “Okay, it fetches data and then answers. Nice trick, but is it really a big deal?” Oh, it is. Here’s why RAG has everyone from CIOs to interns excited:

No more outdated answers: Remember that AI giving me two-year-old data? With RAG, that’s far less likely. If you’ve got up-to-date project documents, the AI will use those instead of its ancient training data. It’s how an AI can know what happened yesterday without having been explicitly trained on yesterday’s events .
Fewer hallucinations: When an AI has solid source material, it doesn’t need to fill gaps with made-up facts. It’s like giving it a map so it won’t wander off into fantasy land. In fact, RAG drastically reduces the “confidence without accuracy” problem. The AI will pull from authoritative sources you trust, rather than the recesses of its imagination.
Project-specific expertise: A vanilla AI is a generalist, it knows a little about a lot. RAG turns it into a specialist for your data. If your project is about, say, building a sustainable supply chain, the AI can absorb your research papers and internal memos on that topic and give answers tailored to that domain, instead of spouting generic sustainability tips.
No re-training needed: One way to get AI to know your info is to train/fine-tune a model on your data, which is expensive, slow, and technically complex. RAG sidesteps that. You keep your big brain AI as is and just feed it tidbits when needed.
Transparency and trust: Many RAG implementations even cite sources for their answers. This means when the AI says “Project X is at 60% budget utilization,” it can show the document it came from. That’s huge for trust because your team can verify the info instead of blindly trusting the AI. It’s the “according to this report…” approach.

Recommended by LinkedIn

How to Get Ready for AI: What Most Organizations Are…

Eric Kimberling 7 months ago

AI is the future. But do we have the right data?

Alok Agrawal 1 year ago

Why AI Isn't Autonomous (Yet)

Jessica Talisman 5 months ago

🚀 READ THIS: Making RAG Happen - How to Empower Your Team’s AI

At this point, you might be eager to try this out (or at least, I hope you are!). So how can you and your team implement RAG for your projects?

1. Identify your data sources. What information would your AI need to help with project decisions? Likely sources include project charters, status reports, requirement documents, team emails/notes, and knowledge base articles. Basically, whatever you would read through to get context, the AI should have access to as well (minus any super confidential stuff you’d never put in an AI, of course).

2. Choose a RAG-friendly tool or platform. The good news: you don’t have to code this from scratch. There are emerging tools that handle the retrieval and even the Q&A interface. (In Part 3, we’ll go into specific services – spoiler: Google’s NotebookLM is one example that lets you upload documents and chat with an AI about them .) If you have a development team handy, there are also frameworks like LangChain or LlamaIndex that make it easier to build a custom solution with document retrieval. Or, some enterprise AI platforms have RAG capabilities built-in, where you can connect a SharePoint or Confluence repository directly.

3. Index or embed your data. Under the hood, RAG often uses something called embeddings (a way to convert text into vectors – think numeric representations – for easy matching). You don’t need to deeply understand the math, but essentially the tool will create an index of your documents so it can quickly find relevant bits later. This step might involve using a vector database or search index. Many tools handle this automatically when you upload your files.

4. Set boundaries and test. It’s wise to start small: maybe give the AI a subset of project data and ask it some test questions. See if the answers make sense and come with references if possible. You want to ensure it’s pulling the right info. Sometimes you might need to fine-tune which files or how many results it uses (too little context and it might miss something; too much and it might get overwhelmed or hit context limits).

5. Iterate and expand. As your projects evolve, keep feeding the AI updated information. RAG is not a one-and-done; it’s an ongoing process of keeping the AI’s “knowledge stash” current. Build it into your project workflow: new report published? Add it to the AI’s data source. Completed a project? Maybe remove those docs if they’re no longer relevant (or keep them for historical Q&A – that can be useful too).

By empowering an AI with RAG, you’re effectively giving every team member a smart assistant that knows your projects. It’s like giving everyone an analyst who’s read all the project documentation and is available 24/7 to answer questions. Imagine the onboarding of new team members – instead of reading a 50-page project brief, they could literally ask the AI, “Hey, what’s the main objective of Project X and what’s been done so far?” and get a tailored summary. Time saver, right?

🎯 Wrapping Up Part 2

We covered a lot of ground here but mainly focus on the value of relevance for an AI achievable through RAG.

In Part 3, we’ll explore some real tools and services that make RAG accessible (including that intriguing NotebookLM we mentioned, and other differences like RAG vs. simply letting an AI surf the web). We’ll also discuss the difference between an AI using your curated data vs. doing a plain web search, and why that distinction matters for practical use in business.

Call to Action: Have you ever tried using AI with your own project or work data? If so, what was your experience? And if not, what data would you most want to feed an AI to help you in your job? Share your ideas or experiences in the comments.

Sources:

AWS – Definition of RAG – Explains RAG as pairing an LLM with external knowledge to improve accuracy (cited by Google’s NotebookLM write-up).
AWS – “What is RAG? (Retrieval-Augmented Generation)” – Highlights LLM limitations (static data, cut-off dates, hallucinations) and how RAG provides relevant, current info to overcome these issues.
AskHandle Blog – “RAG vs. Search” – Describes RAG in simple terms: first retrieve relevant text, then generate a synthesized answer (as opposed to classic search’s approach).
BundleIQ Blog – “Google Notebook LM” – Notes how Google’s NotebookLM uses RAG with LLMs to let users query their own documents, a practical example of RAG in action.

Hashtags: #AI #Data #Context

2 Comments

Damian Leon

4mo

📊 Thanks Keith A. McFarland! Love these illustrations 🙂 to add value here I just posted a clean, 60-second video that simplifies #AIData and #RAG—perfect for execs getting started. 🎯 https://www.linkedin.com/feed/update/urn:li:activity:7354810391165603840/

Sachin Padhiyar

4mo

Excited to see how RAG can enhance project insights!

See more comments

To view or add a comment, sign in

LinkedIn respects your privacy

Feeding Your AI the Right Data: Meet RAG (Part 2 of 3)

Keith A. McFarland

🧐 The Case of the Clueless AI (Why Public Data Isn’t Enough)

🔧 RAG: Giving Your AI the Inside-scoop (a.k.a. Cheat Sheet)

💡 Why RAG is a Game-Changer

Recommended by LinkedIn

🚀 READ THIS: Making RAG Happen - How to Empower Your Team’s AI

🎯 Wrapping Up Part 2

More articles by Keith A. McFarland

Others also viewed

The AI & Data Digest - 5th Edition

Part 2: Stop Limiting the Power of Your AI — Build with Rules Engines for Agentic AI

The non-extractive version of AI.

This Week in AI #1: Scaling Q&A, Efficiency Trade-offs, and Increasing Credibility

Understanding the Role of AI in Government Decision Making

Why Determinism Matters in the Age of Agentic AI

To Finetune Or Not - PART 2: Why Bigger Datasets Don’t Always Mean Better AI

Good data means Good AI

Data is not information, information is not knowledge...

The 7 personas of Machine Learning – and what they need from you as a leader

Explore content categories

🧐 The Case of the Clueless AI (Why Public Data Isn’t Enough)

🔧 RAG: Giving Your AI the Inside-scoop (a.k.a. Cheat Sheet)

💡 Why RAG is a Game-Changer

Recommended by LinkedIn

🚀 READ THIS: Making RAG Happen - How to Empower Your Team’s AI

🎯 Wrapping Up Part 2

More articles by Keith A. McFarland

Beyond the Office: Why AI and Robots…

Big AI Bets… and the Efficiency…

Accelerating with AI: When Speed…

When Walmart Met ChatGPT: The Dawn of…

Agentic Coding and the Future of…

What DORA 2025 Really Tells Us About AI…

Agentic AI with GitHub Copilot? Don’t…

AI Crowd, Real Insights: When Synthetic…

Defeating Nondeterminism in LLM…

Yes, at eSimplicity, We're Still Hiring!

Others also viewed

The AI & Data Digest - 5th Edition

Part 2: Stop Limiting the Power of Your AI — Build with Rules Engines for Agentic AI

The non-extractive version of AI.

This Week in AI #1: Scaling Q&A, Efficiency Trade-offs, and Increasing Credibility

Understanding the Role of AI in Government Decision Making

Why Determinism Matters in the Age of Agentic AI

To Finetune Or Not - PART 2: Why Bigger Datasets Don’t Always Mean Better AI

Good data means Good AI

Data is not information, information is not knowledge...

The 7 personas of Machine Learning – and what they need from you as a leader

Explore content categories