TL;DR

Semble is a code search tool designed for agents that achieves 98% fewer tokens than grep+read, enabling faster and more efficient code retrieval. It indexes repos rapidly and runs entirely on CPU, with no external dependencies. This development could significantly improve code search workflows for AI agents.

Semble, a new code search library tailored for AI agents, claims to cut token usage by approximately 98% compared to traditional grep+read methods, while maintaining high accuracy and speed. It runs entirely on CPU, requires no external services, and can be integrated with popular agents like Claude Code, Codex, and OpenCode, offering instant code retrieval.

Developed for use with agents, Semble indexes a full codebase in about 250 milliseconds and answers queries in roughly 1.5 milliseconds. Benchmarks indicate it achieves a normalized discounted cumulative gain (NDCG@10) of 0.854, comparable to specialized transformer models but at a fraction of their size and cost. The tool can be deployed as an MCP server or used directly via command line, supporting local repositories or remote git URLs.

Semble’s core advantage is its token efficiency, returning only the relevant code snippets with significantly fewer tokens—around 98% less—than traditional grep-based methods. It operates solely on CPU, eliminating the need for GPUs or API keys, which simplifies setup and reduces costs. It also offers features like automatic re-indexing on file changes and caching for session persistence.

Why It Matters

This development matters because it offers a more efficient, faster, and cost-effective way for AI agents to perform code searches, which can enhance automation, debugging, and development workflows. By reducing token consumption and latency, Semble can improve the responsiveness and scalability of code retrieval systems, especially in large codebases.

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

【Vehicle CEL Doctor】The NT301 obd2 scanner enables you to read DTCs, access to e-missions readiness status, turn off…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background

Traditional code search methods like grep read entire files and can be token-heavy, limiting efficiency in AI-driven workflows. Existing transformer-based models for code retrieval are accurate but costly and slow, often requiring specialized hardware. Semble’s approach, which emphasizes speed, token efficiency, and local CPU operation, addresses these limitations. Its announcement on Hacker News highlights ongoing efforts to optimize developer tools for AI integration, with similar tools gaining interest in the developer community.

“Semble returns only the relevant chunks, using ~98% fewer tokens than grep+read, with comparable retrieval quality.”

— Semble team

“This could be a game-changer for AI code agents, making code search faster and cheaper.”

— Hacker News user

Inateck 2D Barcode Scanner, Wireless Bluetooth QR Code Scanner with AI APP & SDK, 180-Day Battery Life, Fast & Accurate Scanning, Compatible with iOS/Android/Windows

Inateck 2D Barcode Scanner, Wireless Bluetooth QR Code Scanner with AI APP & SDK, 180-Day Battery Life, Fast & Accurate Scanning, Compatible with iOS/Android/Windows

Powerful Scanning Capability: The Inateck 2D barcode scanner accurately reads almost all 1D and 2D barcodes within a…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

Details about long-term scalability, integration with all agent types, and real-world performance in large, complex codebases remain to be fully validated. It is also unclear how Semble performs across different programming languages and in diverse development environments.

DeskFX Free Audio Effects & Audio Enhancer Software [PC Download]

DeskFX Free Audio Effects & Audio Enhancer Software [PC Download]

Transform audio playing via your speakers and headphones

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What’s Next

Next steps include broader adoption by AI developer communities, further benchmarking in varied environments, and potential integration into more agent frameworks. Updates may include feature enhancements, support for additional languages, and performance improvements based on user feedback.

Code by Note, Bk 1: Find the Patterns by Reading the Notes, Coloring Book (Color by Note, Bk 1)

Code by Note, Bk 1: Find the Patterns by Reading the Notes, Coloring Book (Color by Note, Bk 1)

Used Book in Good Condition

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

How does Semble compare to traditional grep in terms of speed?

Semble indexes repositories in about 250 ms and answers queries in roughly 1.5 ms, making it significantly faster than grep+read, especially for large codebases.

Can Semble be used with existing AI code agents?

Yes, Semble supports integration with agents like Claude Code, Codex, and OpenCode via MCP or command line, enabling instant code search capabilities.

Is Semble suitable for large, multi-language repositories?

While designed for efficiency, performance in large or multi-language repos depends on indexing and setup. Benchmarks suggest good speed and accuracy, but real-world results may vary.

What are the setup requirements for using Semble?

Semble runs on CPU, requires Python and uv, and can be integrated via MCP or CLI. No GPU, API keys, or external services are necessary.

You May Also Like

An AI Hate Wave Is Here

Recent reports indicate a surge in online hostility towards AI systems, raising concerns about societal impacts and AI development.

OpenAI Campus Network: Student club interest form

OpenAI has introduced a new student club interest form for its Campus Network, inviting students to join and participate in AI-focused activities.

For Most Millennials, Generative AI Is the Key to Efficiency and Balance.

When most millennials harness generative AI, they unlock new levels of efficiency and balance—discover how it can transform your routine today.

AI Is Shrinking Some Job Descriptions and Expanding Others

Keenly understanding AI’s impact on jobs reveals opportunities and challenges that can shape your future career path—discover what lies ahead.