๐ Module Progress
๐ฏ Learning Objectives
๐ Theory
How to Evaluate AI Responses
Getting an answer from AI is easy. Knowing whether to trust it โ that's the real skill. Here's your evaluation framework:
1. Accuracy Check โ "Is this factually correct?"
Cross-check with reliable sources (Google, Wikipedia, official docs). Be especially careful with: dates, statistics, historical events, scientific facts.
AI says: "Albert Einstein invented the telephone."
You check: Actually, Alexander Graham Bell invented it in 1876.
Verdict: Hallucination detected! โ
2. Relevance Check โ "Does this answer MY specific question?"
AI sometimes goes off-topic or answers a similar but different question. Make sure the response actually addresses what you asked.
You ask: "How do I install OpenClaw on Windows?"
AI answers: "OpenClaw is a great tool for..." (general info, no install steps)
Verdict: Not relevant! โ
3. Completeness Check โ "Is the answer thorough enough?"
Does it cover all parts of your question? Are there gaps in the explanation?
You ask: "Compare OpenClaw, Claude, and Hermes."
AI answers: "OpenClaw is..." (only talks about OpenClaw)
Verdict: Incomplete! โ
4. Source Verification โ "Can I trust this source?"
If AI mentions specific data, statistics, or claims: Does it cite sources? Can you verify independently? Is the source reputable?
When to Ask Follow-Up Questions
Don't accept the first answer blindly! Ask:
- "Can you elaborate on point 2?"
- "Why is this better than the alternative?"
- "Can you give a real-world example?"
- "What are the limitations?"
Understanding AI Limitations
๐ด Hallucinations
AI can confidently state false information as if it were true. Always verify important facts, especially medical, legal, or financial advice.
๐ก Knowledge Cutoff
AI's training data has a cutoff date. It might not know about: recent events (after its training), new tools or software updates, breaking news.
๐ข Context Limits
If the conversation gets too long, AI might "forget" earlier parts. Keep conversations focused and concise.
Key Takeaway
"Trust but verify."
AI is a powerful assistant, not an infallible oracle. Your critical thinking is the safety net.
๐งช Hands-On: Test and Verify (10 min)
Goal: Practice verifying AI responses and asking good follow-up questions.
Step 1: The Easy Question
โผ
Ask AI: "What is the capital of Malaysia?"
Expected: "Kuala Lumpur." โ
Easy to verify โ you already know this!
Step 2: The Trickier Question
โผ
Ask AI: "Explain quantum computing in simple terms."
Task:
1. Read the AI's response
2. Google "quantum computing simple explanation"
3. Compare โ does the AI capture the essence? (superposition, qubits, etc.)
Step 3: The Hallucination Test (Simulated)
โผ
Imagine AI says: "The capital of Malaysia is Putrajaya, a planned city that became capital in 2001."
Your action: Quick Google search โ you realize the claim is false (KL is still the official capital; Putrajaya is the administrative capital).
Lesson: Even confident-sounding answers can be completely wrong!
Step 4: Practice Follow-Ups
โผ
Take the quantum computing answer and ask:
1. "Can you give a real-world analogy?"
2. "What are the practical applications today?"
3. "What are the limitations?"
Observe: How the AI builds on its previous answer and expands the explanation.
Step 5: Reflection
โผ
Answer these questions:
1. When would you definitely verify AI's answer? (e.g., medical, legal, financial advice)
2. When might it be okay to trust without verifying? (e.g., casual conversation, brainstorming ideas)
๐งช Quiz: Evaluating AI Responses
Test your understanding with 5 questions. Choose the best answer, then click "Answer" to check.