AI Devtools Landscape

Explore 1 new tool each week to elevate your AI-native developer workflow

Trending

These tools are popping off right now. Try one of these in your dev workflow.

Gemini Deep Research Agent

AI Agents

Open research agent designed for developers

Wave Browser

Browser

Browse the internet and contribute to ocean conservation.

Dvina

Integrations/Gateway

Connects apps and databases using private AI

A2UI by Google

Design

Prototyping

AI-powered tool for building user interfaces

Vibe Pocket

AI Agents

Run AI agents on mobile devices efficiently

Aident AI

No code

Build automations using natural language easily.

AI-Native Directory

Discover the full directory of AI-driven dev tools to elevate your workflow!

Editor’s Picks

The tools we recommend you experiment with this month.

Code Wiki

DevOps

AI-driven wiki generator for code repositories.

Conductor

Development Environment

AI coding agents orchestration with visual dashboard.

Google Antigravity

Development Environment

AI-powered development platform by Google.

Graphite

DevOps

AI code review for faster, quality shipping.

Letta Context-Bench

Testing & Quality

LLM benchmark for context engineering.

LocalAI

Infrastructure

Self-hosted AI engine for local LLMs.

Leaderboards

Stop guessing. See which agents and models actually perform.

Autonomous Agent Leaderboard

Real-world task completion by autonomous AI agents

Rank	Agent + Model	Accuracy %
1	Droid - GPT-5.2	64.90
2	Junie CLI - Gemini 3 Flash	64.30
3	Droid - Claude Opus 4.5	63.10
4	II-Agent - Gemini 3 Pro	61.80
5	Warp - Multiple	61.20
6	Droid - Gemini 3 Pro	61.10
7	Codex CLI - GPT-5.1-Codex-Max	60.40
8	Letta Code - Claude Opus 4.5	59.10
9	Warp - Multiple	59.10
10	Abacus AI Desktop - Multiple	58.40

Results from tbench, access the full leaderboard here • Last updated Dec 29, 2025

Top Coding Models Leaderboard

Code generation and bug-fixing capabilities on real GitHub issues

Rank	Model	Resolved %
1	Claude 4.5 Opus medium (20251101)	74.40
2	Gemini 3 Pro Preview (2025-11-18)	74.20
3	GPT-5.2 (2025-12-11) (high reasoning)	71.80
4	Claude 4.5 Sonnet (20250929)	70.60
5	GPT-5.2 (2025-12-11)	69.00
6	Claude 4 Opus (20250514)	67.60
7	GPT-5.1-codex (medium reasoning)	66.00
8	GPT-5.1 (2025-11-13) (medium reasoning)	66.00
9	GPT-5 (2025-08-07) (medium reasoning)	65.00
10	Claude 4 Sonnet (20250514)	64.93

Results from swebench, access the full leaderboard here • Last updated Dec 29, 2025

The AI Devtools Landscape | AI Native Dev