Skip to content

Lab 001: What are AI Agents?ΒΆ

Level: L50 Path: All paths Time: ~15 min πŸ’° Cost: Free β€” No account needed

What You'll LearnΒΆ

  • What an AI agent is (and isn't)
  • The four core properties of an agent: perception, memory, reasoning, action
  • How agents differ from simple chatbots and from traditional software
  • Real-world examples of AI agents

IntroductionΒΆ

The word "agent" is everywhere in AI right now β€” but what does it actually mean?

An AI agent is software that uses a Large Language Model (LLM) as its reasoning engine to autonomously pursue a goal, deciding what to do and which tools to call at each step β€” without you pre-programming every possible path.

The key word is autonomous: the agent doesn't just answer a question. It plans, acts, observes the result, and adjusts.


The Four Properties of an AgentΒΆ

1. πŸ” PerceptionΒΆ

The agent receives input β€” a user message, a file, an API response, an event.

2. 🧠 Memory¢

The agent stores information across turns: - Short-term (context window): the current conversation - Long-term (vector store / DB): facts, history, retrieved documents

3. πŸ’‘ ReasoningΒΆ

The LLM decides what to do next: answer directly, call a tool, ask a clarifying question, or break the goal into sub-steps.

4. ⚑ Action¢

The agent does something: calls an API, queries a database, writes a file, sends an email, triggers another agent.

AI Agent Loop β€” Perceive, Reason, Act, Observe

πŸ€” Check Your Understanding

In the agent loop, what happens after the agent acts (e.g., calls an API)?

Answer

The agent observes the result β€” the tool output is fed back into context so the LLM can reason over the new information and decide the next step. This closes the loop: perceive β†’ reason β†’ act β†’ observe β†’ reason again.


Agent vs. Chatbot vs. Traditional SoftwareΒΆ

Traditional Software Chatbot AI Agent
Logic defined by Developer Developer LLM (at runtime)
Handles new situations ❌ Only what's coded ⚠️ Within trained patterns βœ… Adapts dynamically
Uses tools βœ… Hardcoded ⚠️ Limited βœ… Discovers and calls tools
Multi-step reasoning ❌ ❌ βœ…
Predictability βœ… Very predictable βœ… Mostly predictable ⚠️ Less predictable

When NOT to use an agent

Agents are powerful but complex. Use a simple LLM call for single-turn Q&A. Use an agent only when the task requires multi-step reasoning, tool use, or dynamic decision-making.

πŸ€” Check Your Understanding

A traditional chatbot follows a pre-programmed decision tree. How does an AI agent differ when it encounters a situation the developer didn't anticipate?

Answer

An AI agent uses the LLM to adapt dynamically at runtime β€” it reasons about the new situation and decides what to do, even if that exact scenario was never coded. A traditional chatbot can only handle what was explicitly programmed.

πŸ€” Check Your Understanding

When should you use a simple LLM call instead of building a full AI agent?

Answer

Use a simple LLM call for single-turn Q&A tasks that don't require multi-step reasoning, tool use, or dynamic decision-making. Agents add complexity and should only be used when the task truly needs autonomy.


Real-World ExamplesΒΆ

Agent What it does
GitHub Copilot Reads your code, suggests completions, chats, runs commands
Zava Sales Agent (this repo's workshop) Queries PostgreSQL, generates charts, interprets sales trends
Microsoft 365 Copilot Reads emails, calendar, files, drafts replies, summarizes meetings
AutoGen research agent Spawns sub-agents to search, synthesize, and write a report
πŸ€” Check Your Understanding

Which of the four core agent properties (perception, memory, reasoning, action) is primarily responsible for the agent deciding what to do next?

Answer

Reasoning. The LLM uses reasoning to decide the next step β€” whether to answer directly, call a tool, ask a clarifying question, or break the goal into sub-steps. Perception handles input, memory stores context, and action executes the decision.


Key TermsΒΆ

Term Definition
LLM Large Language Model β€” the AI brain (e.g., GPT-4o, Phi-4)
Tool / Function A function the LLM can call (e.g., search_database, send_email)
Context window The "working memory" of the LLM β€” everything it can see at once
Prompt The instructions + context sent to the LLM
Token The unit LLMs process β€” roughly ΒΎ of a word
Grounding Connecting agent responses to real, verified data

🧠 Knowledge Check¢

Q1 (Multiple Choice): Which of the following best describes an AI agent?
  • A) A chatbot that follows a pre-programmed decision tree
  • B) A machine learning model fine-tuned on your company's data
  • C) Software that uses an LLM to autonomously pursue a goal, deciding what to do and which tools to call at each step
  • D) A rule-based keyword matching system that routes users to FAQs
βœ… Reveal Answer

Correct: C

An AI agent uses an LLM as its reasoning engine to autonomously decide what to do at each step β€” including which tools to call, when to loop, and when to stop. Option A describes a traditional chatbot. Option B is fine-tuning (changes model behavior, not agent structure). Option D is a classic NLP routing system.

Q2 (Multiple Choice): In the perceive β†’ reason β†’ act β†’ observe loop, what is the purpose of the 'observe' step?
  • A) The agent reformulates the original user query before reasoning
  • B) The agent receives the result of an action and adds it back to context for the next reasoning step
  • C) The agent calls the LLM to generate a final answer
  • D) The agent saves the conversation to long-term memory
βœ… Reveal Answer

Correct: B

After the agent acts (calls a tool, runs code, queries a database), it observes the result β€” the tool output is added back to the message history. This closes the loop: the LLM now has new information to reason over in the next step. The loop continues until the agent decides it has enough to answer.

Q3 (Multiple Choice): Which of the following is NOT one of the four core properties of an AI agent?
  • A) Perception
  • B) Compilation
  • C) Memory
  • D) Action
βœ… Reveal Answer

Correct: B β€” Compilation is not a core agent property

The four core properties are Perception (receives inputs), Memory (retains context), Reasoning (uses LLM to decide next step), and Action (calls tools/APIs/code). Compilation is a programming language concept, not part of the agent architecture.


SummaryΒΆ

An AI agent is an LLM-powered system that perceives, remembers, reasons, and acts to achieve a goal.It differs from traditional software because the logic is not hardcoded β€” the LLM decides at runtime. This flexibility is powerful, but requires careful design of instructions and tools.


Next StepsΒΆ

β†’ Lab 002: AI Agent Landscape 2025 β€” Compare all the tools and platforms available today.