TL;DR

Semble is a code search tool designed for agents that achieves 98% fewer tokens than grep+read, enabling faster and more efficient code retrieval. It indexes repos rapidly and runs entirely on CPU, with no external dependencies. This development could significantly improve code search workflows for AI agents.

Semble, a new code search library tailored for AI agents, claims to cut token usage by approximately 98% compared to traditional grep+read methods, while maintaining high accuracy and speed. It runs entirely on CPU, requires no external services, and can be integrated with popular agents like Claude Code, Codex, and OpenCode, offering instant code retrieval.

Developed for use with agents, Semble indexes a full codebase in about 250 milliseconds and answers queries in roughly 1.5 milliseconds. Benchmarks indicate it achieves a normalized discounted cumulative gain (NDCG@10) of 0.854, comparable to specialized transformer models but at a fraction of their size and cost. The tool can be deployed as an MCP server or used directly via command line, supporting local repositories or remote git URLs.

Semble’s core advantage is its token efficiency, returning only the relevant code snippets with significantly fewer tokens—around 98% less—than traditional grep-based methods. It operates solely on CPU, eliminating the need for GPUs or API keys, which simplifies setup and reduces costs. It also offers features like automatic re-indexing on file changes and caching for session persistence.

Why It Matters

This development matters because it offers a more efficient, faster, and cost-effective way for AI agents to perform code searches, which can enhance automation, debugging, and development workflows. By reducing token consumption and latency, Semble can improve the responsiveness and scalability of code retrieval systems, especially in large codebases.

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

【Vehicle CEL Doctor】The NT301 obd2 scanner enables you to read DTCs, access to e-missions readiness status, turn off…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background

Traditional code search methods like grep read entire files and can be token-heavy, limiting efficiency in AI-driven workflows. Existing transformer-based models for code retrieval are accurate but costly and slow, often requiring specialized hardware. Semble’s approach, which emphasizes speed, token efficiency, and local CPU operation, addresses these limitations. Its announcement on Hacker News highlights ongoing efforts to optimize developer tools for AI integration, with similar tools gaining interest in the developer community.

“Semble returns only the relevant chunks, using ~98% fewer tokens than grep+read, with comparable retrieval quality.”

— Semble team

“This could be a game-changer for AI code agents, making code search faster and cheaper.”

— Hacker News user

PHSRIO OBD2 Scanner for Toyota Corolla Matrix Yaris Prius/Prius c 2000-2026, Bluetooth Car Diagnostic Tool, Check Engine Light Code Reader and Eraser for Engine ABS Transmission

PHSRIO OBD2 Scanner for Toyota Corolla Matrix Yaris Prius/Prius c 2000-2026, Bluetooth Car Diagnostic Tool, Check Engine Light Code Reader and Eraser for Engine ABS Transmission

🚘【Vehicle Compatibility】: OBD2 Scanner diagnostic tool for Toyota Corolla Matrix Yaris Prius/Prius c 2000-2026. Designed specifically for gas-powered…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

Details about long-term scalability, integration with all agent types, and real-world performance in large, complex codebases remain to be fully validated. It is also unclear how Semble performs across different programming languages and in diverse development environments.

DeskFX Free Audio Effects & Audio Enhancer Software [PC Download]

DeskFX Free Audio Effects & Audio Enhancer Software [PC Download]

Transform audio playing via your speakers and headphones

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What’s Next

Next steps include broader adoption by AI developer communities, further benchmarking in varied environments, and potential integration into more agent frameworks. Updates may include feature enhancements, support for additional languages, and performance improvements based on user feedback.

Command & Conquer CoJo Field Products Gun & Tool Holder

Command & Conquer CoJo Field Products Gun & Tool Holder

SUPERIOR GUN/TOOL HOLDER

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

How does Semble compare to traditional grep in terms of speed?

Semble indexes repositories in about 250 ms and answers queries in roughly 1.5 ms, making it significantly faster than grep+read, especially for large codebases.

Can Semble be used with existing AI code agents?

Yes, Semble supports integration with agents like Claude Code, Codex, and OpenCode via MCP or command line, enabling instant code search capabilities.

Is Semble suitable for large, multi-language repositories?

While designed for efficiency, performance in large or multi-language repos depends on indexing and setup. Benchmarks suggest good speed and accuracy, but real-world results may vary.

What are the setup requirements for using Semble?

Semble runs on CPU, requires Python and uv, and can be integrated via MCP or CLI. No GPU, API keys, or external services are necessary.

You May Also Like

Regex Chess: A 2-ply minimax chess engine in 84,688 regular expressions

A developer created a functional chess engine using 84,688 regular expressions, demonstrating an unconventional approach to programming chess logic.

Build vs Buy a Prebuilt AI Workstation

Thorsten Meyer AI says component spikes have narrowed the DIY price edge for AI workstations, making prebuilts a closer call in 2026.

AMÁLIA and the future of European Portuguese LLMs

Portugal invests €5.5M in AMÁLIA, an open-source LLM focused on European Portuguese, marking a significant step in regional NLP development.

Meta layoffs starting this week stress harsh AI reality inside Zuckerberg’s company

Meta is starting a new round of layoffs this week, reducing staff by 10%, as it ramps up AI investments despite internal stress and uncertain future size.