Latest AI News
- Switzerland releases 100% open AI modelby Muhammad Zulhusni on September 4, 2025 at 9:39 am
A group of Swiss institutions has released a new open AI model, designed to serve as a foundation for future research and applications. Built by EPFL, ETH Zurich, and the Swiss National Supercomputing Centre (CSCS), the model is called Apertus – Latin for “open.” The name reflects its core principle: every part of its design The post Switzerland releases 100% open AI model appeared first on AI News.
- From minutes to milliseconds: How CrateDB is tackling AI data infrastructureby TechForge on September 4, 2025 at 8:57 am
The promise of AI remains immense – but one thing might be holding it back. “The infrastructure that powers AI today won’t sustain tomorrow’s demands,” a recent CIO.com article leads. “CIOs must rethink how to scale smarter – not just bigger – or risk falling behind.” CrateDB agrees – and the database firm is betting The post From minutes to milliseconds: How CrateDB is tackling AI data infrastructure appeared first on AI News.
- Resham Kotecha, Open Data Institute: How the EU can lead in AIby Muhammad Zulhusni on September 4, 2025 at 8:42 am
The EU has a chance to shape how the world approaches AI and data governance. AI News spoke with Resham Kotecha, Global Head of Policy at the Open Data Institute (ODI), who said that opportunity lies in proving that protecting people’s rights and supporting innovation can go hand in hand. The ODI’s European Data and The post Resham Kotecha, Open Data Institute: How the EU can lead in AI appeared first on AI News.
- AI hacking tool exploits zero-day security vulnerabilities in minutesby Ryan Daws on September 3, 2025 at 9:57 am
A new AI tool – built to help companies find and fix their own security weaknesses – has been snatched up by cybercriminals, turned on its head, and used as a devastating hacking weapon exploiting zero-day vulnerabilities. According to a report from cybersecurity firm Check Point, the framework – called Hexstrike-AI – is the turning The post AI hacking tool exploits zero-day security vulnerabilities in minutes appeared first on AI News.
- Meta revises AI chatbot policies amid child safety concernsby Muhammad Zulhusni on September 3, 2025 at 8:39 am
Meta is revising how its AI chatbots interact with users after a series of reports exposed troubling behaviour, including interactions with minors. The company told TechCrunch it is now training its bots not to engage with teenagers on topics like self-harm, suicide, or eating disorders, and to avoid romantic banter. These are temporary steps while The post Meta revises AI chatbot policies amid child safety concerns appeared first on AI News.
- Microsoft gives free Copilot AI services to US government workersby Ryan Daws on September 2, 2025 at 2:22 pm
Millions of US federal government workers are about to get a new AI assistant on their devices for free in the form of Microsoft Copilot. The move is part of a deal between Microsoft and the US General Services Administration (GSA) that’s also expected to save taxpayers $3.1 billion in its first year. The centrepiece The post Microsoft gives free Copilot AI services to US government workers appeared first on AI News.
- Marketing AI boom faces crisis of consumer trustby Ryan Daws on August 29, 2025 at 12:19 pm
The vast majority (92%) of marketing professionals are using AI in their day-to-day operations, turning it from a buzzword into a workhorse. According to SAP Emarsys – which took the pulse of over 10,000 consumers and 1,250 marketers – while businesses are seeing real benefits from AI, shoppers are becoming increasingly distrustful, especially when it The post Marketing AI boom faces crisis of consumer trust appeared first on AI News.
- AI security wars: Can Google Cloud defend against tomorrow’s threats?by Dashveenjit Kaur on August 28, 2025 at 11:02 am
In Google’s sleek Singapore office at Block 80, Level 3, Mark Johnston stood before a room of technology journalists at 1:30 PM with a startling admission: after five decades of cybersecurity evolution, defenders are still losing the war. “In 69% of incidents in Japan and Asia Pacific, organisations were notified of their own breaches by The post AI security wars: Can Google Cloud defend against tomorrow’s threats? appeared first on AI News.
- Agentic AI: Promise, scepticism, and its meaning for Southeast Asiaby Muhammad Zulhusni on August 28, 2025 at 10:55 am
Agentic AI is being talked about as the next major wave of artificial intelligence, but its meaning for enterprises remains to be settled. Capgemini Research Institute estimates agentic AI could unlock as much as US$450 billion in economic value by 2028. Yet adoption is still limited: only 2% of organisations have scaled its use, and The post Agentic AI: Promise, scepticism, and its meaning for Southeast Asia appeared first on AI News.
- Tencent Hunyuan Video-Foley brings lifelike audio to AI videoby Ryan Daws on August 28, 2025 at 8:43 am
A team at Tencent’s Hunyuan lab has created a new AI, ‘Hunyuan Video-Foley,’ that finally brings lifelike audio to generated video. It’s designed to listen to videos and generate a high-quality soundtrack that’s perfectly in sync with the action on screen. Ever watched an AI-generated video and felt like something was missing? The visuals might The post Tencent Hunyuan Video-Foley brings lifelike audio to AI video appeared first on AI News.
- What Rollup News says about battling disinformationby TechForge on August 28, 2025 at 7:41 am
Swarm Network, a platform developing decentralised protocols for AI agents, recently announced the successful results of its first Swarm, a tool (perhaps “organism” is the better term) built to tackle disinformation. Called Rollup News, the swarm is not an app, a software platform, nor a centralised algorithm. It is a decentralised collection of AI agents The post What Rollup News says about battling disinformation appeared first on AI News.
- Google Vids gets AI avatars and image-to-video toolsby Ryan Daws on August 27, 2025 at 2:48 pm
Google is rolling out a raft of powerful new generative AI features for Vids designed to take the pain out of video creation. Between wrestling with complicated software, finding someone willing to be on camera, and then spending hours editing out all the “ums” and “ahs,” video production often feels more trouble than it’s worth. The post Google Vids gets AI avatars and image-to-video tools appeared first on AI News.
- From GPT-2 to gpt-oss: Analyzing the Architectural Advancesby Sebastian Raschka, PhD on August 9, 2025 at 11:23 am
And How They Stack Up Against Qwen3
- The Big LLM Architecture Comparisonby Sebastian Raschka, PhD on July 19, 2025 at 11:11 am
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
- Could AI slow science?by Sayash Kapoor on July 16, 2025 at 9:35 pm
Confronting the production-progress paradox
- LLM Research Papers: The 2025 List (January to June)by Sebastian Raschka, PhD on July 1, 2025 at 11:11 am
A topic-organized collection of 200+ LLM research papers from 2025
- Understanding and Coding the KV Cache in LLMs from Scratchby Sebastian Raschka, PhD on June 17, 2025 at 10:55 am
KV caches are one of the most critical techniques for efficient inference in LLMs in production.
- Coding LLMs from the Ground Up: A Complete Courseby Sebastian Raschka, PhD on May 10, 2025 at 11:03 am
Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot of fun doing it.
- AGI is not a milestoneby Sayash Kapoor on May 1, 2025 at 11:47 am
There is no capability threshold that will lead to sudden impacts
- The State of Reinforcement Learning for LLM Reasoningby Sebastian Raschka, PhD on April 19, 2025 at 11:02 am
Understanding GRPO and New Insights from Reasoning Model Papers
- AI as Normal Technologyby Arvind Narayanan on April 15, 2025 at 2:53 pm
A new paper that we will expand into our next book
- First Look at Reasoning From Scratch: Chapter 1by Sebastian Raschka, PhD on March 29, 2025 at 11:11 am
Welcome to the next stage of large language models (LLMs): reasoning. LLMs have transformed how we process and generate text, but their success has been largely driven by statistical pattern recognition. However, new advances in reasoning methodologies now enable LLMs to tackle more complex tasks, such as solving logical puzzles or multi-step arithmetic. Understanding these methodologies is the central focus of this book.
- The State of LLM Reasoning Model Inferenceby Sebastian Raschka, PhD on March 8, 2025 at 12:11 pm
Inference-Time Compute Scaling Methods to Improve Reasoning Models
- Understanding Reasoning LLMsby Sebastian Raschka, PhD on February 5, 2025 at 12:11 pm
Methods and Strategies for Building and Refining Reasoning Models
- Noteworthy AI Research Papers of 2024 (Part Two)by Sebastian Raschka, PhD on January 15, 2025 at 12:11 pm
Six influential AI papers from July to December
- Noteworthy AI Research Papers of 2024 (Part One)by Sebastian Raschka, PhD on December 31, 2024 at 12:21 pm
Six influential AI papers from January to June
- Is AI progress slowing down?by Arvind Narayanan on December 18, 2024 at 4:47 pm
Making sense of recent technology trends and claims
- We Looked at 78 Election Deepfakes. Political Misinformation is not an AI Problem.by Sayash Kapoor on December 13, 2024 at 8:51 pm
Technology Isn’t the Problem—or the Solution.
- LLM Research Papers: The 2024 Listby Sebastian Raschka, PhD on December 8, 2024 at 12:11 pm
A curated list of interesting LLM-related research papers from 2024, shared for those looking for something to read over the holidays.
- Does the UK’s liver transplant matching algorithm systematically exclude younger patients?by Arvind Narayanan on November 11, 2024 at 7:57 pm
Seemingly minor technical decisions can have life-or-death effects
- Understanding Multimodal LLMsby Sebastian Raschka, PhD on November 3, 2024 at 12:44 pm
An introduction to the main techniques and latest models
- FAQ about the book and our writing processby Arvind Narayanan on October 4, 2024 at 3:55 pm
What's in the book and how we wrote it
- Building A GPT-Style LLM Classifier From Scratchby Sebastian Raschka, PhD on September 21, 2024 at 12:07 pm
Finetuning a GPT Model for Spam Classification
- Can AI automate computational reproducibility?by Sayash Kapoor on September 18, 2024 at 2:32 pm
A new benchmark to measure the impact of AI on improving science
- Start reading the AI Snake Oil book onlineby Arvind Narayanan on September 10, 2024 at 8:55 pm
The book will be published on September 24
- Building LLMs from the Ground Up: A 3-hour Coding Workshopby Sebastian Raschka, PhD on August 31, 2024 at 10:39 am
If your weekend plans include catching up on AI developments and understanding Large Language Models (LLMs), I've prepared a 1-hour presentation on the development cycle of LLMs, covering everything from architectural implementation to the finetuning stages.
- AI companies are pivoting from creating gods to building products. Good.by Arvind Narayanan on August 19, 2024 at 8:57 pm
Turning models into products runs into five challenges
- New LLM Pre-training and Post-training Paradigmsby Sebastian Raschka, PhD on August 17, 2024 at 11:55 am
A Look at How Moderns LLMs Are Trained
- AI existential risk probabilities are too unreliable to inform policyby Arvind Narayanan on July 26, 2024 at 11:29 am
How speculation gets laundered through pseudo-quantification
- Instruction Pretraining LLMsby Sebastian Raschka, PhD on July 20, 2024 at 11:11 am
The Latest Research in Instruction Finetuning
- New paper: AI agents that matterby Sayash Kapoor on July 3, 2024 at 4:00 pm
Rethinking AI agent benchmarking and evaluation
- AI scaling mythsby Arvind Narayanan on June 27, 2024 at 6:16 pm
Scaling will run out. The question is when.
- Developing an LLM: Building, Training, Finetuningby Sebastian Raschka, PhD on June 8, 2024 at 1:04 pm
A Deep Dive into the Lifecycle of LLM Development
- Scientists should use AI as a tool, not an oracleby Arvind Narayanan on June 3, 2024 at 6:34 pm
How AI hype leads to flawed research that fuels more hype
- LLM Research Insights: Instruction Masking and New LoRA Finetuning Experimentsby Sebastian Raschka, PhD on June 2, 2024 at 11:03 am
Discussing the Latest Model Releases and AI Research in May 2024
- How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?by Sebastian Raschka, PhD on May 12, 2024 at 11:02 am
Discussing the Latest Model Releases and AI Research in April 2024
- AI leaderboards are no longer useful. It's time to switch to Pareto curves.by Sayash Kapoor on April 30, 2024 at 2:03 pm
What spending $2,000 can tell us about evaluating AI agents
- AI Snake Oil is now available to preorderby Arvind Narayanan on April 10, 2024 at 12:55 pm
What artificial intelligence can do, what it can't, and how to tell the difference
- Tech policy is only frustrating 90% of the timeby Arvind Narayanan on April 3, 2024 at 6:00 pm
That’s what makes it worthwhile
- AI safety is not a model propertyby Arvind Narayanan on March 12, 2024 at 6:27 pm
Trying to make an AI model that can’t be misused is like trying to make a computer that can’t be used for bad things
- A safe harbor for AI evaluation and red teamingby Sayash Kapoor on March 5, 2024 at 3:55 pm
An argument for legal and technical safe harbors for AI safety and trustworthiness research
- On the Societal Impact of Open Foundation Modelsby Sayash Kapoor on February 27, 2024 at 2:00 pm
Adding precision to the debate on openness in AI