ai-models

Gemini 3.1 Pro — Competing for Top Spot in Coding & Reasoning Performance

AI Tools Hub

2026年3月6日

#Gemini #Google #AI #code generation #reasoning

Google's Gemini 3.1 Pro, released in February 2026, achieves 80.6% on SWE-Bench Verified and 2887 Elo on LiveCodeBench, positioning it as a top-tier reasoning model alongside Claude Opus 4.6.

Gemini 3.1 Pro — Competing for Top Spot in Coding & Reasoning Performance

Google released Gemini 3.1 Pro on February 19, 2026. As a top-tier reasoning model alongside Claude Opus 4.6, it has achieved outstanding results in coding and software engineering benchmarks.

Key Features

Multimodal x 1 Million Token Context

Can process text, images, audio, video, PDFs, and even entire code repositories simultaneously. The 1 million token context window enables large-scale information analysis.

Advanced Reasoning & Agentic Capabilities

Equipped with complex problem-solving abilities, capable of planning, executing, monitoring, and adapting multi-step tasks. Optimized for finance, spreadsheet applications, and document analysis.

Efficiency & Thinking Options

Improved token efficiency with a new “MEDIUM” thinking level parameter, allowing users to balance cost, performance, and speed.

Google Ecosystem Integration

Google Workspace: Integrated into Gmail, Docs, Sheets, Slides, Meet
Nano Banana 2: Generates studio-quality images from prompts (released February 26, 2026)
Veo 3.1: Creates 8-second videos at 720p/1080p
NotebookLM: Generates podcasts from documents
Personal Intelligence: Connects to Gmail, Photos, Drive, Search for personalized answers

Coding-Specific Features

Custom Tools Endpoint: gemini-3.1-pro-preview-customtools prioritizes custom tools like view_file and search_code in agentic workflows
Nano Banana Image Generation: Directly generates animated SVGs from text

Benchmark Performance

Benchmark	Gemini 3.1 Pro	Claude Opus 4.6	GPT-5.2
LiveCodeBench (Elo)	2887	-	2393
SWE-Bench Verified	80.6%	80.8%	80.0%
SciCode	59%	52%	-
Terminal-Bench 2.0	68.5%	65.4%	54.0%
SWE-Bench Pro	54.2%	-	56.8%

Highlights:

Leads 13 out of 16 benchmarks
More than doubled performance on ARC-AGI-2 compared to previous model
Resolved output truncation issues

Future Outlook

While Gemini 3.1 Pro is the current leading model, the next generation (Gemini 4?) is expected in late 2026, with further improvements in context window size, multimodal understanding, and reasoning capabilities.

Conclusion

Gemini 3.1 Pro represents Google’s most significant AI update, establishing itself as a top-tier model alongside Claude Opus 4.6. Its performance in coding and software engineering is particularly noteworthy, making it a strong option for developers.