GPT-5.4 beats humans at desktop tasks

PLUS: Netflix acquires Affleck's AI startup & Meta Ray-Ban privacy scandal. Anthropic maps AI job risks, Google upgrades visual search.

In today’s agenda:

1️⃣ OpenAI's GPT-5.4 scores 75% on desktop navigation—above human baseline of 72.4%

2️⃣ Netflix acquires Ben Affleck's InterPositive to automate post-production workflows

3️⃣ Meta's Ray-Ban AI glasses send intimate footage including nudity and bank cards to human reviewers in Kenya

  • Anthropic releases AI job displacement framework—hiring for exposed fields among young workers fell 14%

  • Google upgrades Circle to Search and Lens with multi-object visual search powered by Gemini's "fan-out" technique

H-FARM EVENTS ON AI

H-FARM CLUB Artificial Intelligence

H-FARM's AI Club is a monthly community meetup that brings up to 500 people together in our beautiful H-FARM Campus to cut through the AI noise with real news, real cases, real debate. Join us on March 12 from 6:00 to 8:00 PM. On stage this edition:

  • Latest news & updates by Diego Pizzocaro (CEO, H-FARM AI): A practical, up-to-date selection of news, newly released or updated tools, and best practices useful across industries.

  • Automating Workflows with AI by Emilio Turco (Senior Account Executive, n8n): How is AI automation evolving? From agentic workflows and human-in-the-loop systems to reliability at scale — real-world use cases, lessons learned, and best practices around governance, security, and evaluation to discover how the most advanced companies are building cutting-edge solutions today.

  • More speakers to come.

MAIN AI UPDATES / 6th March 2026

🧠 GPT-5.4 beats humans at desktop tasks 🧠
New reasoning model outperforms on 83% of job evaluations

OpenAI released GPT-5.4, its most powerful model yet, scoring 75% on OSWorld-V desktop navigation tests—above the human baseline of 72.4%. The model delivers major upgrades on coding, reasoning, science, and math, with support for up to 1 million tokens of context and a new extreme reasoning mode for multi-hour tasks. GPT-5.4 won or matched professionals 83% of the time across 44 different job categories on GDPval benchmarks, up from 71% for GPT-5.2.

🎬 Netflix acquires Affleck's AI startup 🎬
InterPositive team joins to automate post-production workflows

Netflix acquired InterPositive, the AI filmmaking company Ben Affleck founded in 2022, bringing all 16 employees and Affleck himself aboard as senior adviser. The startup's technology trains models on a production's own footage to handle relighting, background swaps, and continuity fixes. Affleck emphasized the tech "is not generating video from nothing" but learning from existing filmed shots and actors. The Oscar winner previously stated he "can't stand" what AI writes but sees massive potential for production workflow improvements.

👓 Meta Ray-Ban privacy scandal 👓
Smart glasses send intimate footage to human reviewers

A joint investigation revealed that Meta's Ray-Ban AI glasses are routing highly sensitive user footage—including nudity, bathroom visits, and bank card details—to human data annotators employed by subcontractor Sama in Nairobi, Kenya. Reports show that the glasses often keep recording after being set down and that face-blurring tools don't always work as intended. At least one class action lawsuit has already been filed, accusing Meta of false advertising and privacy violations.

INTERESTING TO KNOW

📊 Anthropic maps AI job risks 📊

Anthropic released a study measuring AI's labor market impact through "observed exposure"—comparing tasks AI can do against what people actually use Claude for. Computer programmers top the list at 75% task coverage, followed by customer service reps and data entry at 67%. While no broad unemployment spike has appeared since ChatGPT's 2022 launch, hiring for exposed fields among 22-to-25-year-olds fell 14% in that timeframe.

🔍 Google upgrades visual search 🔍

Google rolled out multi-object visual search for Circle to Search and Lens, allowing users to identify and search for multiple items within a single image simultaneously. Powered by Gemini's multimodal capabilities, the system uses a "fan-out" technique that triggers multiple searches at once and returns one cohesive response. Users can now search an entire outfit, room decor, or garden scene and get results for every component in seconds.

📩 Have questions or feedback? Just reply to this email , we’d love to hear from you!

🔗 Stay connected: