Tech & AI:
Spring 2026
💻
10 March – 5 April 2026
Period27 days of reporting
Key model releasesGPT-5.4 mini, Gemma 4, Qwen 3.5/3.6, Nemotron Cascade 2
Top fundingOpenAI $122B round; Replit $400M at $9B
Major layoffsAtlassian 1,600; Dell 11,000; Oracle 20,000+; Block ~40%
Security incidentsMcKinsey breach (47M messages); litellm supply chain; Claude/Codex source leaks
Breakout projectHermes Agent: 0 → 24,000 GitHub stars in 22 days

Tech & AI: Spring 2026

Contents
  1. Summary
  2. AI model releases & benchmarks
  3. AI agents & coding tools
  4. Open-source momentum
  5. Hardware & infrastructure
  6. Cybersecurity incidents
  7. Corporate moves & funding
  8. Layoffs & workforce shifts
  9. Developer tools & platforms
  10. Drone & defence tech
  11. AI in science & medicine
  12. Local inference revolution
  13. Policy & regulation
  14. Emerging trends
  15. Condensed timeline
  16. Sources

Summary

The period from 10 March to 5 April 2026 was defined by three converging dynamics in the technology sector: the rapid maturation of open-source AI models approaching parity with closed frontier systems, an explosion in autonomous AI agent frameworks reshaping software development, and a series of major security incidents exposing the fragility of the AI supply chain.

On the model front, Google DeepMind’s Gemma 4 family (up to 31B dense) surpassed far larger models on ELO rankings, while Alibaba’s Qwen 3.5/3.6 series achieved near-parity with Anthropic’s Opus 4.5 at a fraction of the cost. OpenAI’s GPT-5.4 processed 5 trillion tokens per day—exceeding the entire API’s yearly volume from 2025—and generated $1 billion in annualised net-new revenue, while the GPT-5.4 mini delivered a 32x efficiency gain ($11.84/task down to $0.37).

The period’s breakout project was Hermes Agent by NousResearch, which rocketed from launch to 24,000 GitHub stars in 22 days, becoming the third fastest-growing repo on GitHub and the fifth most-used AI agent globally. The broader AI agent ecosystem—including Cursor, Claude Code, OpenAI Codex, and the new Hermes Agent—saw rapid competitive leapfrogging with no defensible positions emerging.

Security dominated headlines: McKinsey’s internal AI chatbot “Lilli” was breached, exposing 47 million chat messages and 728,000 client records via trivial SQL injection. A supply chain attack on the litellm PyPI package (97 million monthly downloads) exfiltrated SSH keys, cloud credentials, and crypto wallets—discovered only because the attacker’s memory leak bug caused a system crash. Both Claude Code and OpenAI Codex had their source code leaked within the same 24-hour window.

Financially, OpenAI closed a landmark $122 billion funding round, while the industry shed over 30,000 jobs across Atlassian, Dell, Oracle, Block, and rumoured Meta cuts—all explicitly linked to AI-driven restructuring.

AI model releases & benchmarks

Frontier and near-frontier models

ModelCompanyDateKey specs
GPT-5.4OpenAI~16 Mar5T tokens/day; $1B annualised net-new ARR
GPT-5.4 miniOpenAI17 Mar32x efficiency over GPT-5.2; $0.37/task (from $11.84)
Gemma 4 (31B Dense)Google DeepMind2 AprSurpasses Qwen 122B and DeepSeek v3.2 on ELO
Gemma 4 (26B MoE, 4B active)Google DeepMind2 AprParity with Kimi K2.5 (1.1T params) at 35x smaller; runs on mobile
Qwen 3.5 27B DenseAlibaba26 Mar256K context; fits on RTX 5090; “Claude Sonnet 4.6 quality at home”
Qwen 3.6-PlusAlibaba31 MarNear-parity with Opus 4.5; wins Terminal-Bench 2.0; tied on SWE-bench
GLM-5.1Zhipu21 MarOpen-source; long-horizon agentic engineering (week-scale tasks)
MiniMax M2.7MiniMax31 MarOpen-sourced; “closest to fully local Claude Code + Opus 4.6”
Nemotron 3 Super 120BNVIDIA11 MarMoE (12B active); 1M context; open weights/datasets
Nemotron Cascade 2NVIDIA25 MarMamba-2 arch; 187 tok/s on RTX 3090; flat perf 4K–625K context
Trinity-Large-Thinking2 Apr#2 PinchBench; Apache 2.0; ~96% cheaper than Opus

Specialised models

Performance degradation

All major frontier models—Opus 4.6, ChatGPT 5.4, and Codex 5.3—experienced significant performance degradation during this period. Flutter development was particularly affected, with non-compiling tests and false passing claims. Claude Opus 4.6 was reported entering circular reasoning loops, while ChatGPT 5.4 created overly complex solutions for simple problems.

AI agents & coding tools

Hermes Agent (NousResearch)

The breakout project of the period. Launched to 24,000+ GitHub stars in 22 days, making it the #3 fastest-growing GitHub repo and the 5th most-used AI agent globally:

Coding assistant landscape

The AI coding tool market saw rapid competitive leapfrogging with no defensible positions:

Autonomous development paradigm

Open-source momentum

Open-source AI models approached parity with closed frontier systems during this period:

Key tension: Anthropic criticised for restrictive licensing on skills (vs. Apache 2.0 for most Codex skills) and for blocking third-party access through OpenClaw. Users migrating to Codex, Droid, Kimi CLI, and OpenCode alternatives.

Hardware & infrastructure

NVIDIA GTC highlights

Consumer hardware

Infrastructure projects

Quantisation breakthroughs

Cybersecurity incidents

McKinsey “Lilli” chatbot breach (13 Mar)

Internal chatbot trained on 100,000 documents, used by 70% of 45,000 employees (500,000 monthly prompts). Exposed: 47 million chat messages (strategy/M&A data), 728,000 confidential client records, 57,000 user accounts, 95 system prompts. Root cause: publicly exposed API documentation with 22 unauthenticated endpoints vulnerable to SQL injection. Patched after discovery.

litellm PyPI supply chain attack (24 Mar)

Versions ≥1.64.0 exfiltrated SSH keys, AWS/GCP/Azure credentials, Kubernetes configs, git credentials, API keys, shell history, crypto wallets, SSL private keys, CI/CD secrets, and database passwords. The package has 97 million monthly downloads, and transitive dependencies (notably DSPy) were also compromised. The attack was discovered only because the attacker’s memory leak bug caused a system crash. Karpathy called supply chain attacks “the scariest thing imaginable in modern software.”

Source code leaks (31 Mar)

Both Anthropic’s Claude Code and OpenAI’s Codex had their source code exposed within the same 24-hour window. Claude Code was leaked via sourcemaps (~512,000 lines); thousands of forks appeared within hours. Anthropic sent DMCA takedown notices—ironic given it had issued similar takedowns against open-source forks 336 days earlier.

Other incidents

Corporate moves & funding

Major funding

CompanyAmountNotes
OpenAI$122 billionLargest AI funding round ever (1 Apr). Offering PE firms 17.5% guaranteed minimum return + early model access.
Replit$400M at $9BLaunched Replit Agent 4 (11 Mar)
Yann LeCun’s lab$1.03 billionNew AI research lab with global mission (10 Mar)
OpenAI Foundation$1B first-year spendFocus: biological threats, economic disruption, societal effects (24 Mar)

Acquisitions

Service shutdowns

Layoffs & workforce shifts

Over 30,000 jobs were shed in a four-week period, all explicitly linked to AI-driven restructuring:

CompanyCutsDateNotes
Oracle20,000+1 Apr
Meta~15,000 (rumoured)14 Mar20% reduction to fund AI investments
Dell11,00019 Mar
Atlassian1,600 (10%)12 MarCited AI impact on business
Block~40% headcount20 MarFramed as AI automation

PwC benchmarked OpenAI’s finance team at 20% of typical company size. Jensen Huang stated engineers earning $500k should consume $250k+ in AI tokens. Developers in the Polish IT industry reported AI tools increasing workload due to layoff fears rather than improving quality of life.

The code review crisis was highlighted by Gene Kim and Jez Humble: AI-generated code volume is overwhelming traditional review processes, raising questions about quality assurance, accountability, and career development.

Developer tools & platforms

Drone & defence tech

Ukraine’s drone technology became a major export and diplomatic asset (see also Russo-Ukrainian War: Spring 2026):

AI in science & medicine

Local inference revolution

A recurring theme was the surprising competitiveness of local inference on consumer hardware versus expensive enterprise infrastructure:

The dense vs. MoE surprise

Qwen 3.5 27B dense running on a single $900 RTX 3090 repeatedly outperformed 120B+ MoE models on $70K enterprise nodes (2x H200 NVL) in real-world autonomous coding tasks. Dense models (every parameter active per token) showed unexpected strength in agent-based coding where architectural coherence matters more than raw parameter count.

Recommended inference engines (by use case)

Key benchmarks

VRAM landscape

Community survey: 8–12GB (31.6%), 24GB (34.5%), 48GB+ (21.5%). Memory bandwidth (not VRAM size) identified as the true local LLM bottleneck.

Policy & regulation

AI adoption metrics

Developer sentiment

Platform dynamics

Upcoming models (2026 watch list)

MiniMax M3 (multimodal), NVIDIA Nemotron 3 Ultra (~500B parameters), Kimi K3, DeepSeek V4, GLM-5.1 full release, GPT-Image-2 (leaked on arena leaderboards under codenames maskingtape-alpha, gaffertape-alpha, packingtape-alpha).

Condensed timeline

DateKey events
10 MarKarpathy’s Autoresearch: 11% efficiency gain via 700 autonomous experiments. Yann LeCun raises $1.03B for new lab. Tinygrad modular datacenter ($3M, 20 exaflops target).
11 MarNVIDIA Nemotron 3 Super 120B (1M context). Replit: $400M at $9B. Claude Code adds scheduled automation. TADA open-source TTS.
12 MarNetflix acquires InterPositive ($600M). Atlassian lays off 1,600. Ukraine opens battlefield data for AI training. Universalis: “last generation to write code by hand.”
13 MarMcKinsey “Lilli” breach: 47M messages, 728K client records via SQL injection. U.S. deploys 10,000 Ukrainian Merops drones. AI adoption at 34% “early majority.”
14 MarOllama upgrades to NVIDIA B300. Personalised mRNA cancer vaccine ($3K, ChatGPT + AlphaFold). Chrome 146 local AI toggle. Meta rumoured 20% cuts.
15 MarApple Neo MacBook ($599, modular). Opus 4.6 / GPT-5.4 / Codex 5.3 all degrading. Open-source radar at 95% cost reduction.
16 MarGPT-5.4: 5T tokens/day, $1B net-new ARR. Niantic Spatial: 1,000 autonomous delivery robots. NVIDIA up 1,000x since 2009. Code review crisis emerges.
17 MarGPT-5.4 mini: 32x efficiency ($0.37/task). NVIDIA DLSS 5 “ChatGPT moment for graphics.” GKE replaces etcd with Spanner. Codex subagents for parallel workflows.
18 MarICML 2026: 497 papers desk-rejected for AI reviewer use. Nemotron 3 Nano 4B for edge. Shahed drone flies 5 km from Polish border. Solo dev “Ben” at $4.4M ARR.
19 MarOpenAI acquires Astral (uv + ruff). Google launches Stitch (Figma competitor). Cline VS Code compromised (~4K machines). Dell lays off 11,000. Meta shuts Horizons Worlds.
20 MarCursor releases first custom model (routes through Kimi/Moonshot). Block cuts ~40%. Super Micro: $2.5B GPU smuggling. Telegram added to Claude. DoorDash films chores for robotics data.
21 MarGLM-5.1 open-sourced. Halter (AI cow collars) valued at $2B+. Patient uses ChatGPT for cancer treatment. Anthropic CEO on government dissent.
22 MarMiniMax M2.7 open weights announced. Hermes Agent: 10K stars, 8+ day continuous runs. GPU supply chain shortages. Cargill CarVe: $200M meat recovery. Terrafab rumour (Tesla/xAI/SpaceX chips).
23 MarApple M5 Pro/Max: RDMA clustering, 1T-param models at 70+ tok/s. NVIDIA Kimodo: text-to-3D motion. Cursor revealed using Chinese Kimi model. Framework price reductions.
24 Marlitellm supply chain attack: SSH keys, cloud creds exfiltrated (97M monthly downloads). Hermes Agent v0.4.0: 300 PRs in one week. Nemotron Cascade: 3B active, IMO gold-medal math. OpenAI Foundation ($1B). Lace Lithography: 10x Moore’s Law extension claim.
25 MarNemotron Cascade 2: 187 tok/s on RTX 3090. Apple Siri switching to Gemini. OpenAI Sora discontinued. TurboQuant: 6x memory reduction. Code.Storage for AI-generated repos.
26 MarQwen 3.5 27B: “release of 2026 so far.” Hermes Agent: 13.3K stars. Karpathy proposes fully autonomous DevOps agents. OpenAI erotic features shut down in 48h.
27 MarStargate Michigan: first steel beams. Cohere Transcribe (2B, 14 languages). Hermes Agent GODMODE for model jailbreaking. mRNA vaccine highlighted by Altman. Apple rumoured reconsidering SwiftUI.
28 MarGemini 3.1 Flash Live API. Qwen 3.5 27B: 35 tok/s on RTX 3090, 262K context. levelsio: 24-min MVP build. RLHF critique: optimises for confidence not accuracy.
29 MarHermes Agent v0.5.0: browser automation. Qwen 9B builds 2,699-line game on first try. Nemotron Cascade 2 fails coding coherence tests. llama.cpp > Ollama for agents.
30 Marllama.cpp reaches 100K GitHub stars. John Deere: 92% daily AI engagement. Hermes Agent: 3,000+ members. Claude Code integrates with Codex.
31 MarClaude Code + Codex source code leaked in same 24h window (~512K lines). npm axios supply chain attack (300M downloads). MiniMax M2.7 open-sourced. Transformers.js v4 (WebGPU). Qwen 3.6-Plus preview.
1 AprOpenAI closes $122B funding round. Oracle lays off 20,000+. Hermes Agent ranked 6th AI app on OpenRouter. Tinygrad Mac eGPU support. Chrome extension malware wave.
2 AprGoogle DeepMind Gemma 4 (31B surpasses Qwen 122B on ELO). Trinity-Large-Thinking: #2 PinchBench, 96% cheaper than Opus. OpenAI acquires TBPN. OpenScreen open-sourced (8.4K stars).
3 AprGemma 4 26B MoE matches 1.1T-param Kimi K2.5 at 35x smaller. Cursor 3 launches. Hermes Agent: #3 fastest-growing GitHub repo. Vibe Jam 2026 ($35K prizes). mesh-llm open-sourced by Block.
4 AprHermes Agent v0.7.0: 24K stars (from 13K in 2 days). GPT-Image-2 leaked on arena. Ollama Cloud ($20/mo). RTX 3090 ($900) outperforms $70K H200 nodes in coding. Vibe Jam: participants as young as 10.
5 AprAnthropic blocks third-party access. Gemma 4 E2B runs on iPhone 17 Pro. Block open-sources Goose agent (150+ MCP servers). PwC: OpenAI finance team at 20% of normal size.

Sources

Compiled from daily tech/AI/IT intelligence reports published at waclaw.online/raport2/tech/, covering 10 March – 5 April 2026 (27 daily reports). Individual reports available for each date.

Categories: Technology | Artificial intelligence | Cybersecurity | 2026