
Context Engineering: What the Model Is Allowed to See
The reliability of an AI system is decided less by how you word the prompt and more by what you let into the context window. Context engineering is the named discipline for that, and it is the highest-leverage AI skill for anyone doing real work, especially in regulated settings.
Read article
OpenAI Built Its Own Chip. The Real Story Is the Cost of Intelligence.
On 24 June, OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom chip, built to run its models more cheaply. The coverage is about a strike at Nvidia. The useful signal for a practitioner is the opposite of hardware: it is the falling cost of intelligence, and what that does to your AI decisions and your governance. Cost has quietly been holding AI back. That fence is coming down.
Read article
Your AI Assistant Just Became a Shared Teammate. Govern the Channel.
On 23 June, Anthropic launched Claude Tag, a single shared Claude that lives in a Slack workspace with its own memory and admin-scoped access to channels, tools and data. The unit of AI collaboration just moved from the private conversation to the team channel. The thing you now have to govern is no longer a prompt. It is a standing presence. Here is what changes, and the three decisions to make before it is live.
Read article
Model Context Protocol: The Standard Wiring AI Into Your Tools
In eighteen months the Model Context Protocol went from an Anthropic experiment to the way AI plugs into your tools and data. Understanding what it is matters less than governing the connectors, because each one is a new door into your systems. Here is the capability and the control work.
Read article
The Strongest Open Model Is Now Chinese. Mind Where Your Data Goes.
On 16 June, China's Z.ai released GLM-5.2 under an MIT licence with no regional limits, the highest-ranked open-weights model on its own coding benchmarks. With Anthropic's Fable 5 pulled by a US directive, the strongest model you can simply download and run is now Chinese. The decision that carries your risk is not the model. It is whether you run the open weights yourself or send your data to the hosted API.
Read article
ChatGPT Just Got Better at Health. Mind the Boundary.
On 18 June OpenAI announced a substantial step up in ChatGPT's health intelligence, free to the 230 million people who already ask it health questions every week. Better answers do not move the boundary between information and a clinical decision. Here is what that means for Australian professionals this week.
Read article
Stop Trusting the Leaderboard: Evaluate AI on Your Own Work
A new frontier model lands most months, the public benchmarks they tout are methodologically shaky, and the demo always wins. The only evidence that should move your money is performance on your own work. Here is how to test it, using a private evaluation Project you build yourself.
Read article
Business Teams Can Now Build Their Own AI Agents
Databricks launched Genie One this week, an agentic coworker pitched at finance and marketing teams, not engineers. The real shift is who holds the build button, and where that moves the control point. Here is what to do this week, with a governance prompt you can run today.
Read article
The OWASP Agentic Top 10: A Defence Playbook for the Agents You Are Deploying
OWASP has published a Top 10 built specifically for AI agents. It reframes the agent as a privileged user that reads untrusted text and acts with your access. Here is the practical defence playbook.
Read article
AI Is Moving Into the Core Systems of Regulated Work
This week two of the world's largest IT services firms began wiring a frontier model into the core systems that banks, insurers and airlines run on, not the chat window. Here is what it means for regulated work, and what to do this week.
Read article
AI Week in Review, 8-14 June 2026: A Frontier Model Pulled by Government Order
The week a US directive forced Anthropic to suspend two new frontier models worldwide, plus six verified vendor moves and a repeatable method for turning AI news into Monday actions.
Read article
Claude Fable 5: Frontier Capability, With Conditions Attached
Anthropic has put a Mythos-class model on general release, and the conditions matter as much as the capability. A silent classifier fallback, a mandatory 30-day retention policy and a 23 June billing switch all belong in your next third-party AI assessment.
Read article
Gemini 3.5 Flash: Google Makes the Agent the Default
Google made an agentic model the worldwide default in the Gemini app and AI Mode in Search before the flagship even shipped. The benchmark and pricing evidence says Flash genuinely replaces last generation's Pro, and millions of workers got a more autonomous default model overnight.
Read article
Microsoft's Seven MAI Models: The In-House Bet Under Copilot
Microsoft launched seven home-grown MAI models at Build 2026 and started swapping them into Copilot and the Microsoft 365 stack. For practitioners the story is procurement, not benchmarks: data lineage claims, weight tuning, and a billing change in the same week.
Read article
AI Agents Need Approval Gates Before They Need Autonomy
Autonomous AI agents are becoming practical, but organisations should design approval gates, permissions and evidence trails before granting action rights.
Read article
AI Upskilling Will Fail If HR Does Not Redesign the Work
Training people to use AI is useful, but HR also needs to redesign roles, capability frameworks and quality controls around changed work.
Read article
Small Models, Edge AI and the Next Governance Blind Spot
As AI moves into devices, business apps and smaller specialised models, organisations need governance that looks beyond frontier models and public chatbots.
Read article
Workplace AI and Privacy: The Trust Test HR Cannot Outsource
AI productivity tools can reshape workplace data collection, monitoring and employee trust. HR needs a privacy-first governance model before adoption scales.
Read article
The AI Pilot-to-Scale Gap Is an Operating Model Problem
Most organisations can run AI pilots. Far fewer can scale them safely, consistently and usefully across real work.
Read article
AI in Hiring Needs Human Review Before It Needs Another Tool
Australian HR teams can use AI in recruitment, but hiring workflows need privacy discipline, bias checks, candidate transparency and accountable human judgement.
Read article
Agentic Browsing: What Actually Shipped This Month
Three agentic browsing platforms shipped meaningful updates in April. The demos are convincing. The production reliability is not. Where these agents work, where they fail, and what to do this quarter.
Read article
On-Device AI at Work: Apple Intelligence and Pixel Gemini Nano
On-device AI is enterprise-ready in narrow ways and not in the ways the demos suggest. Apple Intelligence and Pixel Gemini Nano in April 2026: what works, what does not, and the real privacy story.
Read article
2M-Token Multimodal Contexts: Where They Actually Pay Off
Two-million-token multimodal context is real. The marketing says it replaces RAG. The production data says it does not. Three workflows where it pays off and three where it does not.
Read article
The Open-Source Frontier in April 2026: Llama 4, DeepSeek R2, Mistral Sovereign
Three serious open-weight contenders shipped in April 2026. None of them is the right answer for every workload, but each has carved out a defensible enterprise niche. Here is the comparison.
Read article
Australian AI Safety Standard: 18-Month Review
Eighteen months in, Australia's voluntary AI Safety Standard has shifted from optional reading to procurement table stakes. Three things worked. Two did not. The next phase is moving towards mandatory.
Read article
Reasoning Budgets in Production: How Teams Are Spending Them
Anthropic shipped reasoning budgets in late March. Six weeks of production data shows the feature pays for itself when teams set the right ceilings. It does not when they leave it on default.
Read article
GPT-5 in the Enterprise: 60-Day Debrief
Sixty days after GPT-5 hit enterprise GA, the tool-use story is real and the pricing story is messier. Three patterns separate the teams getting value from the teams burning credits.
Read articleNews Posts
Rapid updates when AI news breaks
Concise rapid updates when AI news breaks. 150-200 words, no filler, straight to the signal. Available on the site and cross-posted to X (@TheAICommand).
Build 2026 makes agents first-class citizens of the Microsoft stack
OAIC survey finds trust in AI companies has collapsed to 4 per cent
Mistral puts frontier-class weights on four GPUs with Medium 3.5
ASIC research maps AI spreading through underwriting and claims
Google I/O 2026: agents get desktops, sandboxes and enterprise plumbing
ASIC demands urgent cyber uplift as frontier AI raises the threat level
Interactive tools



