How to Build a Local AI Agent with LM Studio plus Gemma 3 270M

Weeks after building a RAG pipeline and playing around with different agents on Digital Ocean’s Gradient Platform, I’ve been fascinated by AI agents lately. While going through my bookmarks, I came across this resource that breaks down How to Build an Agent. It was such a nice read and since I’ve been experimenting with a lot of local models, especially Gemma 3 270M, it was a no brainer to try and test out Gemma’s tool calling capabilities. I’ll try break down everything from first principles to allow you to easily replicate this.

Understanding the Basics

What is an AI Agent?

While Large Language Models are limited to conversational interactions, an agent possesses the capability to execute actual tasks and operations. An agent can actually do things and execute on actual tasks. An AI agent can:

Understand what you want (through natural language)
Decide what actions to take (using its training)
Execute those actions (through tools we give it)
Remember the conversation context (to have coherent interactions)

The secret sauce? Tools. Tools give an AI the ability to read files, search the web, or edit documents, and suddenly you have an agent working autonomously handling tasks on its own.

How our Agent actually works

Our agent is essentially a conversation loop with superpowers. When you type “create a hello world program,” here’s what happens:

The agent listens to your message and adds it to the conversation history
Sends everything to the AI - your request plus a list of available tools (read_file, list_files, edit_file, create_file)
The AI decides - “I should create a simple Python program that prints hello world”
Requests the create_file tool with parameters: `{“path”: “hello.py”, “content”: “print(‘Hello, World!’)”}`
Our agent executes it and feeds the results back: “File created at hello.py”
The AI responds - “I’ve created a hello world program in Python for you. You can run it with python3 hello.py”

The best part about our approach is that the AI never actually touches the files in your device. It just makes intelligent decisions about which tools to use, while your local agent code does the actual work.

By the end of this guide, you’ll have built that complete system - one that can have natural conversations about files and actually manipulate them on your machine, all running locally with no cloud dependencies.

Getting Started

I did this in Go like the original resource. I figured it could be easier to build on the code they had and modify it for my local use cases.

Before we dive into the actual implementation, you’ll need a few things set up.

Setting Up LM Studio

LM Studio has become my go-to for local AI work. I’m privileged to have a device that can run a diverse set of AI models up to 30B parameters quite reliably and effortlessly.

Download LM Studio from lmstudio.ai
Install it like any regular application
Load a model - I recommend starting with Gemma 3 (about 500MB) or Qwen 3 4B if you want better results (or any model you want to use)

Once installed:

Go to the ” Discover” tab and search for “gemma-3-270m-it-mlx”. The exact repo of the Gemma model I use is this one on my Mac.
Download it (grab a coffee or even take a break - even small models take a few minutes)
Click “Load” to activate it
Head to the Developer and toggle to start running the server. You’ll see “Status: Running”

You should see the server is “Reachable at: http://127.0.0.1:1234”. Leave this running as it’s your local AI API now.

Quick Verification

Let’s make sure everything’s working:

1
curl http://127.0.0.1:1234/v1/models

If you see your model listed, you’re ready to build!

You'll see this when you curl

Install Go

Download Go from go.dev and follow the installation instructions. Verify it’s working with go version . If you see a version number, you’re set.

check go version

Building Our Agent

Project Structure

I like to start simple and grow complexity as needed. Let’s create a clean structure:

1
mkdir local-agent
2
cd local-agent
3
go mod init github.com/yourusername/local-agent

This creates our project directory and a go.mod file for dependency management. Now we’ll create four files, each with a specific purpose:

File	What It Does	Why It Matters
client.go	Talks to LM Studio	Your connection to the AI
tools.go	Defines what the agent can do	The agent’s capabilities
agent.go	Orchestrates everything	The brain of the operation
main.go	Starts the program	Your entry point

This separation for me is key and it saved me countless debugging hours. When something breaks, you know exactly where to look. I know the original post had everything in one main.go file but it was wiser to implement it this way on my end.

Building the LM Studio Client

LM Studio speaks the OpenAI API format, which is usually great and easy to use. This means we can use standard HTTP without any special SDKs. When working locally and for experiments, I think direct HTTP calls teach you more about what’s happening, and when things inevitably break, you’ll know where to look quite easily.

Create client.go:

1
package main
2

3
import (
4
    "bytes"
5
    "encoding/json"
6
    "fmt"
7
    "io"
8
    "net/http"
9
)
10

11
// LMStudioClient handles all communication with our local AI
12
type LMStudioClient struct {
13
    baseURL string // Where LM Studio lives (http://127.0.0.1:1234)
14
    model   string // Which model to use
15
}
16

17
func NewLMStudioClient(baseURL, model string) *LMStudioClient {
18
    return &LMStudioClient{
19
        baseURL: baseURL,
20
        model:   model,
21
    }
22
}

This approach is so clutch cause it needs no dependencies. No version conflicts. Just standard HTTP doing it’s thing.

Understanding Messages

Every conversation with an AI is just a series of messages.

1
type Message struct {
2
    Role       string     `json:"role"`        // Who's talking: "user", "assistant", or "tool"
3
    Content    string     `json:"content"`     // What they're saying
4
    ToolCalls  []ToolCall `json:"tool_calls,omitempty"`  // Tools the AI wants to use
5
    ToolCallID string     `json:"tool_call_id,omitempty"` // Links responses back to requests
6
}

A typical conversation might look like:

User: “What files are in my documents folder?”
Assistant: “Let me check…” (requests list_files tool)
Tool: “Found: test.pdf, notes.txt ” (tool result)
Assistant: “I found 2 items in your documents folder: test.pdf, notes.txt,“

Sending Requests

Here’s where the magic happens—talking to our local AI:

1
func (c *LMStudioClient) CreateChatCompletion(messages []Message, tools []ToolSchema) (*ChatResponse, error) {
2
    // Package up our request
3
    request := ChatRequest{
4
        Model:       c.model,        // Which AI to use
5
        Messages:    messages,       // The conversation so far
6
        Tools:       tools,          // Available tools
7
        MaxTokens:   1000,           // Response length limit
8
        Temperature: 0.7,            // Creativity level (0=boring, 1=creative)
9
    }
10

11
    // Tell the AI it can use tools when needed
12
    if len(tools) > 0 {
13
        request.ToolChoice = "auto"
14
    }
15

16
    // Convert to JSON and send
17
    jsonData, err := json.Marshal(request)
18
    if err != nil {
19
        return nil, fmt.Errorf("failed to create request: %w", err)
20
    }
21

22
    resp, err := http.Post(
23
        c.baseURL+"/v1/chat/completions",
24
        "application/json",
25
        bytes.NewBuffer(jsonData),
26
    )
27
    if err != nil {
28
        return nil, fmt.Errorf("failed to send request: %w", err)
29
    }
30
    defer resp.Body.Close()
31

32
    // Parse the response
33
    body, err := io.ReadAll(resp.Body)
34
    if err != nil {
35
        return nil, fmt.Errorf("failed to read response: %w", err)
36
    }
37

38
    if resp.StatusCode != http.StatusOK {
39
        return nil, fmt.Errorf("API error (status %d): %s", resp.StatusCode, string(body))
40
    }
41

42
    var chatResp ChatResponse
43
    if err := json.Unmarshal(body, &chatResp); err != nil {
44
        return nil, fmt.Errorf("failed to parse response: %w", err)
45
    }
46

47
    return &chatResp, nil
48
}

What I love about this approach is its transparency. Every request and response is visible. When debugging, you can see exactly what you’re sending and what comes back.

Creating Tools

Now, like I previously mentioned, tools transform an AI from a chatbot into something that can actually do things. So let’s create some tools.

Create tools.go:

1
package main
2

3
import (
4
    "encoding/json"
5
    "fmt"
6
    "os"
7
    "path/filepath"
8
    "strings"
9
)
10

11
// Every tool needs these four things
12
type Tool interface {
13
    Name() string                                    // What the AI calls it
14
    Description() string                             // When to use it
15
    Schema() ToolSchema                              // What parameters it needs
16
    Execute(args json.RawMessage) (string, error)   // What it actually does
17
}

Building the File Reader

Let’s start with something useful—reading files. This tool can be useful if you need an agent to analyze documents or configuration files.

1
type ReadFileTool struct{}
2

3
func (t ReadFileTool) Name() string {
4
    return "read_file"
5
}
6

7
func (t ReadFileTool) Description() string {
8
    return "Read the contents of a file"
9
}
10

11
func (t ReadFileTool) Schema() ToolSchema {
12
    return ToolSchema{
13
        Type: "function",
14
        Function: struct {
15
            Name        string                 `json:"name"`
16
            Description string                 `json:"description"`
17
            Parameters  map[string]interface{} `json:"parameters"`
18
        }{
19
            Name:        t.Name(),
20
            Description: t.Description(),
21
            Parameters: map[string]interface{}{
22
                "type": "object",
23
                "properties": map[string]interface{}{
24
                    "path": map[string]interface{}{
25
                        "type":        "string",
26
                        "description": "The path to the file to read",
27
                    },
28
                },
29
                "required": []string{"path"},
30
            },
31
        },
32
    }
33
}
34

35
func (t ReadFileTool) Execute(args json.RawMessage) (string, error) {
36
    var params map[string]interface{}
37
    if err := json.Unmarshal(args, &params); err != nil {
38
        return "", fmt.Errorf("invalid arguments: %w", err)
39
    }
40

41
    // Handle different parameter names that models might use
42
    path := ""
43
    if p, ok := params["path"].(string); ok {
44
        path = p
45
    } else if p, ok := params["file"].(string); ok {
46
        path = p
47
    } else if p, ok := params["filename"].(string); ok {
48
        path = p
49
    }
50

51
    if path == "" {
52
        return "", fmt.Errorf("path parameter required")
53
    }
54

55
    content, err := os.ReadFile(path)
56
    if err != nil {
57
        return "", fmt.Errorf("failed to read file: %w", err)
58
    }
59

60
    return string(content), nil
61
}

Why Flexible Parameter Handling Matters

Here’s something I learned the hard way: different AI models use slightly different parameter names. One model might send {“path”: “file.txt”} while another sends {“file”: “file.txt”} or {“filename”: “file.txt”}.

Early versions of my agents would break constantly because of this. Now I handle common variations upfront, and my agents work reliably across different models.

Building More Tools

Following the same pattern, I built tools for listing directories, editing files, and creating new files. Each one follows the four-part structure: name, description, schema, and execution function.

The list files tool is particularly useful:

1
func (t ListFilesTool) Execute(args json.RawMessage) (string, error) {
2
    var params map[string]interface{}
3
    json.Unmarshal(args, &params)
4

5
    // Handle parameter variations
6
    path := ""
7
    if p, ok := params["path"].(string); ok {
8
        path = p
9
    } else if p, ok := params["dir"].(string); ok {
10
        path = p
11
    } else if p, ok := params["directory"].(string); ok {
12
        path = p
13
    }
14

15
    // Default to current directory
16
    if path == "" || path == "current directory" || path == "." {
17
        path = "."
18
    }
19

20
    entries, err := os.ReadDir(path)
21
    if err != nil {
22
        return "", fmt.Errorf("failed to read directory: %w", err)
23
    }
24

25
    var files []string
26
    for _, entry := range entries {
27
        prefix := "📄"
28
        if entry.IsDir() {
29
            prefix = "📁"
30
        }
31
        files = append(files, fmt.Sprintf("%s %s", prefix, entry.Name()))
32
    }
33

34
    return strings.Join(files, "\n"), nil
35
}

I added the emoji prefixes to they make the output from the agents much more chat-like, readable and prettier. Simply delete it from the prefix and leave it blank if you don’t need it for your use case.

Now comes the interesting part, building the agent that coordinates everything. This is where I’ve spent the most time iterating and improving.

Create agent.go:

1
package main
2

3
import (
4
    "bufio"
5
    "encoding/json"
6
    "fmt"
7
    "os"
8
    "strings"
9
)
10

11
type Agent struct {
12
    client    *LMStudioClient  // Connection to the AI
13
    tools     map[string]Tool  // Available tools
14
    messages  []Message        // Conversation history
15
    maxTurns  int             // Safety limit
16
}
17

18
func NewAgent(client *LMStudioClient) *Agent {
19
    tools := make(map[string]Tool)
20

21
    // Register all available tools
22
    toolList := []Tool{
23
        ReadFileTool{},
24
        ListFilesTool{},
25
        EditFileTool{},
26
        CreateFileTool{},
27
    }
28

29
    for _, tool := range toolList {
30
        tools[tool.Name()] = tool
31
    }
32

33
    return &Agent{
34
        client:   client,
35
        tools:    tools,
36
        messages: []Message{},
37
        maxTurns: 10, // Prevent infinite loops
38
    }
39
}

The Core Loop

This is where everything comes together. The agent needs to:

Listen to the user
Decide if tools are needed
Execute tools if requested
Respond naturally

1
func (a *Agent) processWithAI() error {
2
    turns := 0
3

4
    for turns < a.maxTurns {
5
        turns++
6

7
        // Ask the AI what to do
8
        response, err := a.client.CreateChatCompletion(a.messages, a.getToolSchemas())
9
        if err != nil {
10
            return fmt.Errorf("failed to get AI response: %w", err)
11
        }
12

13
        assistantMessage := response.Choices[0].Message
14

15
        // Did the AI want to use tools?
16
        if len(assistantMessage.ToolCalls) > 0 {
17
            fmt.Println("\n🔧 Using tools...")
18

19
            // Remember what the AI said
20
            a.messages = append(a.messages, assistantMessage)
21

22
            // Execute each requested tool
23
            for _, toolCall := range assistantMessage.ToolCalls {
24
                result := a.executeToolCall(toolCall)
25

26
                // Save the result for the AI to see
27
                a.messages = append(a.messages, Message{
28
                    Role:       "tool",
29
                    Content:    result,
30
                    ToolCallID: toolCall.ID,
31
                })
32

33
                fmt.Printf("   ✓ %s executed\n", toolCall.Function.Name)
34
                if result != "" && !strings.HasPrefix(result, "Error:") {
35
                    fmt.Printf("   Result: %s\n", result)
36
                }
37
            }
38

39
            // Loop again so the AI can use the tool results
40
            continue
41
        }
42

43
        // AI gave a final answer
44
        if assistantMessage.Content != "" {
45
            fmt.Printf("\n🤖 Agent: %s\n", assistantMessage.Content)
46
            a.messages = append(a.messages, assistantMessage)
47
        }
48

49
        break
50
    }
51

52
    return nil
53
}

What I find elegant about this design is its simplicity. The agent doesn’t need to understand what each tool doe, it simply trusts the AI to make good decisions and executes whatever is requested.

Error Handling

I learned to handle errors gracefully after watching too many early agents crash on simple mistakes (that were mostly tiny Gemma’s fault lol):

1
func (a *Agent) executeToolCall(toolCall ToolCall) string {
2
    tool, exists := a.tools[toolCall.Function.Name]
3
    if !exists {
4
        availableTools := make([]string, 0, len(a.tools))
5
        for name := range a.tools {
6
            availableTools = append(availableTools, name)
7
        }
8
        return fmt.Sprintf("Error: Unknown tool '%s'. Available: %v",
9
            toolCall.Function.Name, availableTools)
10
    }
11

12
    result, err := tool.Execute(json.RawMessage(toolCall.Function.Arguments))
13
    if err != nil {
14
        return fmt.Sprintf("Error executing %s: %v", toolCall.Function.Name, err)
15
    }
16

17
    return result
18
}

Bringing It All Together

The main function is super simple:

Create main.go:

1
package main
2

3
import (
4
    "flag"
5
    "fmt"
6
    "log"
7
)
8

9
func main() {
10
    // Allow customization via command line
11
    serverURL := flag.String("server", "http://127.0.0.1:1234", "LM Studio server URL")
12
    model := flag.String("model", "qwen3-4b-thinking-2507-mlx", "Model to use")
13
    flag.Parse()
14

15
    // Connect everything
16
    client := NewLMStudioClient(*serverURL, *model)
17
    agent := NewAgent(client)
18

19
    fmt.Printf("Connecting to %s using model %s\n\n", *serverURL, *model)
20

21
    if err := agent.Run(); err != nil {
22
        log.Fatal(err)
23
    }
24
}

Building and Testing

Time to see our creation in action:

1
go build -o agent
2
./agent

You should see:

1
Connecting to http://127.0.0.1:1234 using model qwen3-4b-thinking-2507-mlx
2

3
🤖 Local AI Agent (Powered by Gemma 3)
4
Available tools: read_file, list_files, edit_file, create_file
5
Type 'exit' to quit
6
--------------------------------------------------
7

8
👤 You:

Let’s verify that our setup works with a quick API test:

1
curl -s -X POST http://127.0.0.1:1234/v1/chat/completions \
2
  -H "Content-Type: application/json" \
3
  -d '{
4
    "model": "qwen3-4b-thinking-2507-mlx",
5
    "messages": [
6
      {"role": "user", "content": "List the files in the test_files directory"}
7
    ],
8
    "tools": [{
9
      "type": "function",
10
      "function": {
11
        "name": "list_files",
12
        "description": "List files in a directory",
13
        "parameters": {
14
          "type": "object",
15
          "properties": {
16
            "path": {"type": "string", "description": "Directory path"}
17
          },
18
          "required": ["path"]
19
        }
20
      }
21
    }],
22
    "tool_choice": "auto"
23
  }'

If you see a tool call in the response, you’re ready to go!

Seeing It In Action

Let me walk you through what it feels like to use the agent. When you type a message, here’s what happens behind the scenes:

Your message gets added to the conversation history
The full conversation + available tools are sent to the AI
The AI thinks and decides whether it needs to use tools
If tools are needed, the agent executes them and feeds results back
The AI formulates a response using the tool results
You see the final answer

Here’s a real session that shows this flow:

1
$ ./agent
2
Connecting to http://127.0.0.1:1234 using model qwen3-4b-thinking-2507-mlx
3

4
🤖 Local AI Agent (Powered by Gemma 3)
5
Available tools: read_file, list_files, edit_file, create_file
6
Type 'exit' to quit
7
--------------------------------------------------
8

9
👤 You: list files in test_files
10

11
🔧 Using tools...
12
   Model requested 1 tool(s)
13
   ✓ list_files executed
14
   Result: 📄 hello.txt
15
           📄 data.json
16

17
🤖 Agent: The files in the test_files directory are:
18
- hello.txt
19
- data.json
20

21
👤 You: read hello.txt from test_files
22

23
🔧 Using tools...
24
   Model requested 1 tool(s)
25
   ✓ read_file executed
26
   Result: Hello from the test file!
27
           This is a sample file for testing our local AI agent.
28
           It contains multiple lines of text.
29
           Feel free to edit me!
30

31
🤖 Agent: The contents of `test_files/hello.txt` are:
32
            Hello from the test file!
33
            This is a sample file for testing our local AI agent.
34
            It contains multiple lines of text.
35
            Feel free to edit me!

Model Comparison: What I’ve Learned

After testing various models, I’ve discovered some interesting patterns that might help you choose the right one for your needs.

Gemma 3 270M: The flash

Gemma 3 works and is very fast but it can sometimes misunderstands instructions. It’s great for prototyping because:

Lightning fast responses (under 200ms typically) - Super quick to first token too!
Supports function calling (yes, even at 270M parameters!) - Insane how powerful small AI models are right now
Low resource usage (runs on modest hardware) - the 8-bit version I use needs about 1GB of RAM at a high context window

But it has quirks:

Sometimes writes Python code instead of using tools as needed
May use different parameter names than expected - and sometimes it hallucinates random parameter names (this is genuinely quite annoying - but could also potentially be fixed by strict custom rules - I did not do any of that for this)
Best with very clear, direct prompts

Qwen 3 4B: The Thinker

Qwen 3 4B has become my go-to for serious work. It’s slower (because of how long it’s thinking process) but dramatically better:

Here’s a real comparison using the same vague prompt: “edit the file in test_files and do what you wish to them”

Qwen 3 4B Response:

1
🔧 Using tools...
2
   Model requested 1 tool(s)
3
   ✓ list_files executed
4
   Result: 📄 comparison_test.py
5
📄 data.json
6
📄 hello.txt
7

8
🔧 Using tools...
9
   Model requested 1 tool(s)
10
   ✓ edit_file executed
11
   Result: File edited successfully
12

13
🤖 Agent: <think>
14
The user said "edit the file in test_files and do what you wish".
15
First, I needed to see what files are available, so I listed them.
16
I see hello.txt is a good candidate for editing. I'll update it
17
to say "Hello World!" instead of just "Hello".
18
</think>
19

20
The file `hello.txt` in `test_files` has been successfully edited
21
to contain: **Hello World!**

Gemma 3 270M Response:

1
import os
2
def edit_files():
3
    try:
4
        for filename in os.listdir("test_files"):
5
            if filename.endswith(".txt"):
6
                print(f"Editing {filename}")
7
    except Exception as e:
8
        print(f"Error: {e}")

The difference is stark. Qwen:

Lists files first to understand what’s available
Makes intelligent decisions about which file to edit
Shows its reasoning process in <think> blocks
Actually performs the action rather than generating code
Communicates clearly about what it did

Complete Qwen Interaction Example

Here’s a full session that showcases Qwen’s impressive reasoning abilities:

1
$ ./agent
2
Connecting to http://127.0.0.1:1234 using model qwen3-4b-thinking-2507-mlx
3

4
🤖 Local AI Agent (Powered by Gemma 3)
5
Available tools: read_file, list_files, edit_file, create_file
6
Type 'exit' to quit
7
--------------------------------------------------
8

9
👤 You: edit the file in test_files and do what you wish to them
10

11
🔧 Using tools...
12
   Model requested 1 tool(s)
13
   ✓ list_files executed
14
   Result: 📄 comparison_test.py
15
📄 data.json
16
📄 hello.txt
17

18
🔧 Using tools...
19
   Model requested 1 tool(s)
20
   ✓ edit_file executed
21
   Result: File edited successfully
22

23
🤖 Agent: <think>
24
Okay, let me go through the user's request step by step. The user initially said, "edit the file in test_files and do what you wish to them".
25

26
First, I needed to figure out which file to edit. Since they didn't specify a particular file, I used the list_files function on "test_files" to see what's there. The response showed four files: comparison_test.py, data.json, hello.txt.
27

28
Next, I had to choose which one to edit. The user said "do what you wish", so I picked hello.txt as it's a straightforward text file. To make the edit, I thought about replacing "Hello" with "Hello World!".
29

30
But wait, before making the edit, maybe I should check what's in hello.txt. However, since the user didn't specify any existing content and said "do what you wish", I proceeded with a simple replacement. The tool call for edit_file was made, and the response came back as "File edited successfully".
31

32
Now, I need to confirm that the edit was successful. The user probably expects a confirmation message. Since the tool response says it's done, I should let them know that hello.txt has been updated. No further tool calls are needed here because the edit succeeded.
33

34
I should phrase it clearly: "The file hello.txt in test_files has been successfully edited to contain 'Hello World!'." That's straightforward and addresses the user's request without overcomplicating.
35
</think>
36

37
The file `hello.txt` in `test_files` has been successfully edited to contain:
38
**Hello World!**
39

40
👤 You: exit
41
👋 Goodbye!

What Makes This Impressive:

Autonomous Problem Solving - Didn’t know which file to edit, so listed them first
Logical Decision Making - Chose the most appropriate file to modify
Transparent Reasoning - Shows complete thought process
Tool Chaining - Used multiple tools in sequence naturally
Natural Communication - Provides clear, human-friendly feedback

Model Recommendations

Model	Speed	Quality	Best For
Gemma 3 270M	Very Fast	Unreliable	Prototyping, testing
Qwen3 4B Thinking	Slower	Excellent	Production, complex tasks
GPT-OSS 20B	Slow	Excellent	Heavy-duty work

For learning and experimentation, start with Gemma. For anything serious, use Qwen 3 4B or larger.

Troubleshooting: What I’ve Learned the Hard Way

Building local agents means debugging everything yourself. Here’s my systematic approach when things go wrong:

Connection Issues

“Connection refused” errors:

1
# Test if LM Studio is responding
2
curl http://127.0.0.1:1234/v1/models

If this fails, check that LM Studio’s local server is actually running.

“Model not found” errors: The model name in your code must exactly match what’s loaded in LM Studio. Check the server tab for the exact name.

Tool Issues

Tools not being called:

Try more explicit prompts: “use the list_files tool to show me files in the documents folder”
Add debug output to see what the AI is actually deciding to do
Smaller models need clearer instructions

Parameter mismatches: Add logging to see what parameters the model is sending:

1
fmt.Printf("Tool received: %s\n", string(args))

Then add handling for whatever parameter names you see.

Performance Issues

Slow responses with larger models:

Conversation history grows quite fast, consider pruning old messages
Tool schemas are sent with every request, this adds overhead and can be quite annoying
Larger models simply take more time to think (Esp if you’re using locally running models that may already not be as fast as those provided for by third party inference provides like Cerebras)

Memory usage: Go’s garbage collector usually handles this well, but keep an eye on conversation history size if you’re having long sessions.

Final Thoughts

What surprised me most about building local agents was how approachable it turned out to be. No complex frameworks, no cloud dependencies, no API keys to manage. Just a conversation loop, some tools, and a local AI model. That guide from Ampcode was quite legit too, it gave me a great place to start from.

The gap between a simple chatbot and a capable agent is smaller than I expected. Give an AI the ability to take actions in the real world, and suddenly it becomes incredibly useful from basic to complex tasks.

Your local machine is more powerful than you probably realize. Modern laptops can run models that would have been considered large-scale research projects just a few years ago. Take advantage of it for prototyping and research as much as you can.

What You’ve Learned

By building this agent, you’ve learned:

How AI agents actually work under the hood
The mechanics of tool calling and function execution
HTTP API integration without complex SDKs
Go programming patterns for building reliable systems
How to handle model variations and edge cases

Your Next Steps

Start simple and build up:

Sample projects:

Add a delete_file tool (follow the same pattern as create_file)
Add colored terminal output for better readability
Implement web search using a search API
Create a shell command execution tool
Create specialized agents for specific domains
Add retrieval-augmented generation for long documents

The foundation you’ve built here can grow into almost anything. Local agents are pretty cool and they might be the future of how we interact with AI, especially for those who value their security and privacy.

The complete source code is available in this Github Repository. Try it yourself, break it, fix it, and build something amazing on top of this foundation. The beauty of local agents is that you own the entire stack.

How to Build a Local AI Agent with LM Studio plus Gemma 3 270M

Share this post