shell-agent

A macOS GUI chat and agent tool powered by local LLM.

Features

Multi-turn chat with OpenAI-compatible API (LM Studio)
MCP support via mcp-guardian (multiple servers, stdio proxy)
Shell script Tool Calling with MITL (Man-In-The-Loop) approval for write/execute operations
Timestamp-aware memory — Hot/Warm/Cold sliding window with LLM summarization
Pinned Memory — autonomous extraction and retention of important facts across sessions
Multimodal — image input via drag & drop, paste, or file picker with smart image recall
Markdown rendering with syntax highlighting (GFM, code blocks, tables)
Menu bar launcher (SwiftUI) with global hotkey (Ctrl+Shift+Space)
Security — nlk/guard (prompt injection defense), nlk/jsonfix (JSON repair), nlk/strip (thinking tag removal)
Data analysis — embedded DuckDB for CSV/JSON/JSONL, natural language SQL queries, sliding window summarization, background analysis
Color themes — Dark, Light (cream + blue), Warm (brown), Midnight (navy) with live preview
Settings UI — in-app configuration for API, memory, tools, MCP guardians, theme, and startup mode
Session management — auto-generated titles, rename, delete with confirmation
Startup mode — configurable: new chat or resume last session
Window state persistence — position and size remembered across launches

Architecture

shell-agent/
├── app/          # Wails v2 + React main application (Go backend)
├── launcher/     # SwiftUI menu bar launcher (macOS native)
└── docs/         # Documentation and RFP

Go Backend Packages

Package	Purpose
`internal/chat`	Chat engine, time injection, message building
`internal/client`	OpenAI-compatible API client (streaming + non-streaming, multimodal)
`internal/config`	JSON config with path expansion (~, $ENV)
`internal/mcp`	mcp-guardian stdio child process management
`internal/memory`	Hot/Warm/Cold tiers, pinned memory, image store, session persistence
`internal/objstore`	Central object repository for images, blobs, and reports
`internal/analysis`	DuckDB analysis engine, SQL generation, sliding window summarizer
`internal/toolcall`	Shell script tool registry, header parsing, MITL categories

Requirements

macOS 14+ (launcher), macOS 10.15+ (main app)
LM Studio (or any OpenAI-compatible API server)
Apple Silicon M1/M2 Pro+ recommended (for gemma-4-26b-a4b)

Build

# Main app
cd app
make build

# Launcher
cd launcher/ShellAgentLauncher
swift build

Development

cd app
make dev    # Hot reload with Wails dev server

Tool Scripts

Place shell scripts in ~/Library/Application Support/shell-agent/tools/ with header annotations:

#!/bin/bash
# @tool: list-files
# @description: List files in a directory
# @param: path string "Directory path to list"
# @category: read

Categories: read (auto-execute), write / execute (MITL approval required)

Security: Arguments are passed as JSON via stdin (not command-line arguments) to prevent shell injection. Tool scripts must parse JSON safely (e.g., jq) and never pass parsed values directly to eval or unquoted shell expansion.

MCP Configuration

Add MCP servers via Settings UI or config.json:

{
  "guardians": [
    {
      "name": "filesystem",
      "binary_path": "~/.local/bin/mcp-guardian",
      "profile_path": "~/.config/mcp-guardian/profiles/filesystem.json"
    }
  ]
}

Data Analysis

Load data files and analyze them using natural language or SQL:

User: Load /path/to/sales.csv and show total revenue by region
Agent: [load-data] → [query-preview] → Tokyo: ¥2,024,500, Osaka: ¥918,000, ...

Analysis Tools

Tool	Description
`load-data`	Load CSV/JSON/JSONL into DuckDB
`describe-data`	Show/annotate table schemas
`query-preview`	Natural language → SQL → results
`query-sql`	Execute SQL directly
`suggest-analysis`	LLM suggests analysis perspectives
`quick-summary`	Query + LLM summarization
`analyze-bg`	Background analysis (survives app close)
`analysis-status`	Check background job progress
`analysis-result`	Retrieve completed report
`reset-analysis`	Clear all loaded tables

Background Analysis

For large datasets, analyze-bg spawns a detached process that continues even after Shell Agent is closed:

User: Analyze access logs for security threats
Agent: [analyze-bg] → Job started: job-1713488400000
       ...later...
User: Check analysis status
Agent: [analysis-status] → done (4 windows, 5 findings)
User: Show the report
Agent: [analysis-result] → Security Threat Analysis report

Default Model

google/gemma-4-26b-a4b

Configuration

Settings are stored in ~/Library/Application Support/shell-agent/config.json. Configurable via the in-app Settings panel.

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
app		app
docs		docs
launcher/ShellAgentLauncher		launcher/ShellAgentLauncher
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

shell-agent

Features

Architecture

Go Backend Packages

Requirements

Build

Development

Tool Scripts

MCP Configuration

Data Analysis

Analysis Tools

Background Analysis

Default Model

Configuration

License

About

Uh oh!

Releases 11

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

shell-agent

Features

Architecture

Go Backend Packages

Requirements

Build

Development

Tool Scripts

MCP Configuration

Data Analysis

Analysis Tools

Background Analysis

Default Model

Configuration

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages