Programming Language Benchmarks

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside significantly larger models; it outpaces DeepSeek-V3.2, which scores 70.2%, ...

The Lancet

CARDBiomedBench: a benchmark for evaluating the performance of large language models in biomedical research

Although large language models (LLMs) have the potential to transform biomedical research, their ability to reason accurately across complex, data-rich domains remains unproven. To address this ...

Quesma Releases OTelBench: Independent Benchmark Reveals Frontier LLMs Struggle with Real-World SRE Tasks

New benchmark shows top LLMs achieve only 29% pass rate on OpenTelemetry instrumentation, exposing the gap between ...

TechAnnouncer

Beyond LeetCode: Top Reddit Recommendations for Coding Practice Alternatives

Built-in IDE: Code directly in the browser without needing to set up a local environment.

WinBuzzer

AI Coding: Microsoft’s 7B X-Coder Outperforms 14B Rivals on Synthetic Data

Microsoft and Tsinghua University have developed a 7B-parameter AI coding model that outperforms 14B rivals using only ...

Automation World

Vibe Coding: How Natural Language Programming is Changing Manufacturing Software Development

Vibe coding allows manufacturing personnel to create software using everyday speech instead of traditional programming, enabling production managers to simply say "build a monitoring dashboard for ...

InfoQ

FACTS Benchmark Suite Introduced to Evaluate Factual Accuracy of Large Language Models

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

InfoWorld

8 old programming languages developers won’t quit

Newer languages might soak up all the glory, but these die-hard languages have their place. Here are eight languages developers still use daily, and what they’re good for. The computer revolution has ...

InfoWorld

R language is making a comeback – Tiobe

The R language for statistical computing has creeped back into the top 10 in Tiobe’s monthly index of programming language popularity. “Programming language R is known for fitting statisticians and ...

Wired

Ruby Is Not a Serious Programming Language

My little theory is that the concept of “imprinting” in psychology can just as easily be applied to programming: Much as a baby goose decides that the first moving life-form it encounters is its ...

TechCrunch

Google launches Gemini 3 with new coding app and record benchmark scores

On Tuesday, Google released Gemini 3, its latest and most advanced foundation model, which is now immediately available through the Gemini app and AI search interface. Coming just seven months after ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results