News & analysis, rated.
Breaking AI developments, in-depth guides, real-world case studies, and analysis — each one rated so you know what matters.
OpenAI's LifeSciBench grades AI on 750 real research tasks
OpenAI released LifeSciBench, a 750-task benchmark using expert-written rubrics to evaluate how well AI models perform on actual life-science research work. Here's what the benchmark measures and why it matters for biotech teams.
OpenAI Deploys GPT-4 and o1 with Partners via Dedicated Field Engineers
OpenAI is rolling out frontier models to enterprise customers through a network of field deployment engineers and strategic partners. Here's what's shipping now and what it means for your stack.
OpenAI releases LifeSciBench to test AI on biology tasks
OpenAI introduced LifeSciBench, a benchmark for evaluating frontier models on life science research problems. The tool measures performance across biology-focused tasks to help teams assess model capabilities in the field.
Anthropic Opens Seoul Office, Signs Korean AI Partnerships
Anthropic is establishing a physical presence in South Korea and partnering with local AI companies. The move signals expansion beyond the US market.
OpenAI releases LifeSciBench for evaluating AI on real research tasks
OpenAI introduced LifeSciBench, a benchmark authored and reviewed by life science experts to assess how AI systems perform on actual research decisions. Here's what it measures and why it matters for biotech teams.
OpenAI's GPT-5.4 improved a stubborn drug-making reaction with near-autonomous chemistry
OpenAI and Molecule.one deployed GPT-5.4 to optimize a key medicinal chemistry reaction. The system worked with minimal human direction—signaling a shift in how AI can assist drug discovery workflows.
Google's AMIE AI Matches Primary Care Doctors on Disease Management
Nature study shows Google's medical AI system scored higher than human doctors on treatment plan precision and clinical guideline adherence. Google is now testing AMIE in real clinical settings.
Dark matter hunt hits dead end, physicists broaden search
After decades chasing WIMPs in liquid xenon detectors, physicists face the neutrino fog—background noise that may make traditional detection impossible. The search now splinters across axions, primordial black holes, and quantum sensors.
Solar geoengineering needs aircraft redesign, chemical formulas still unsolved
MIT researchers uncovered major engineering gaps in planetary cooling plans: stratospheric delivery requires planes unlike any flying today, and the best reflective chemical remains unknown. Here's what deployment would actually demand.
Neutrino fog forces dark matter hunt into new territory
Physicists chasing WIMPs hit a detection wall: solar neutrinos now drown out their signals. Researchers are pivoting to quantum sensors, liquid-helium detectors, and Jupiter's atmosphere to restart the search.
UK Councils Deploy Google AI to Cut Planning Application Wait Times 50%
The UK government is rolling out generative AI tools to all 300+ local planning authorities by 2027, automating routine applications like loft conversions that consume 70% of submissions. Here's what planners will actually gain.
Microsoft sells OpenAI models in China while rivals refuse
Microsoft has become the sole US distributor of OpenAI's GPT models to Chinese firms. ByteDance alone spends over $1B annually on Microsoft's AI services—a market OpenAI and Anthropic avoid entirely.