← baptisteblouin.fr · Version française

AI Digest

Sourced summaries of AI / ML news and scientific publications, generated automatically every night from a curated set of RSS feeds.

News 2026-06-22

Models & Benchmarks

Enterprise & Deployment

Tools & Infrastructure

Developer Notes

Ecosystem & Governance

Sources
  1. [1] GLM 5.2 vs. Opus hnrss.org 2026-06-22
  2. [2] Import AI 462: Superpersuasion; self-sustaining AI; paths to ASI jack-clark.net 2026-06-22
  3. [3] PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters huggingface.co 2026-06-22
  4. [4] Samsung Electronics brings ChatGPT and Codex to employees openai.com 2026-06-22
  5. [5] sqlite-utils 4.0rc1 adds migrations and nested transactions simonwillison.net 2026-06-22
  6. [6] sqlite-utils 4.0rc1 simonwillison.net 2026-06-22
  7. [7] Deno Desktop hnrss.org 2026-06-22
  8. [8] Temporary Cloudflare Accounts for AI agents simonwillison.net 2026-06-22
  9. [9] Codex logging bug may write TBs to local SSDs hnrss.org 2026-06-22
  10. [10] Sakana Fugu hnrss.org 2026-06-22
  11. [11] Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions hnrss.org 2026-06-22
  12. [12] Claude Code's "extended thinking" is a summary- not authentic thinking hnrss.org 2026-06-22
  13. [13] Measuring What Matters with Jules google ai 2026-06-22
  14. [14] The semantic debt crisis no one is talking about dbt.com 2026-06-22
  15. [15] Pledging Another $400k to the Zig Software Foundation hnrss.org 2026-06-22

Papers 2026-06-18

Agentic Systems and Tool Use

Multimodal and Vision-Language Models

Efficiency and Serving

Safety, Alignment, and Evaluation

Speech and Audio

Datasets and Benchmarks

Theory and Foundations

Sources
  1. [1] LedgerAgent: Structured State for Policy-Adherent Tool-Calling Agents arxiv cs.CL 2026-06-18
  2. [2] Beyond Global Replanning: Hierarchical Recovery for Cross-Device Agent Systems arxiv cs.CL 2026-06-18
  3. [3] Sovereign Execution Brokers: Enforcing Certificate-Bound Authority in Agentic Control Planes arxiv cs.AI 2026-06-18
  4. [4] When Does Streaming Tool Use Help? Characterizing Tool-Intent Stabilization in Streaming Retrieval-Augmented Generation arxiv cs.CL 2026-06-18
  5. [5] StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs arxiv cs.CL 2026-06-18
  6. [6] UNIEGO: Proxies as Mediators for Unified Egocentric Video Representation Learning arxiv cs.LG 2026-06-18
  7. [7] Scalable Training of Spatially Grounded 2D Vision-Language Models for Radiology arxiv cs.CL 2026-06-18
  8. [8] SARLO-80: Worldwide Slant SAR Language Optic Dataset 80cm arxiv cs.AI 2026-06-18
  9. [9] NAMESAKES: Probing Identity Memorization in Text-to-Image Models arxiv cs.CL 2026-06-18
  10. [10] UltraQuant: 4-bit KV Caching for Context-Heavy Agents arxiv cs.AI 2026-06-18
  11. [11] Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving arxiv cs.LG 2026-06-18
  12. [12] Structuring and Tokenizing Distributed User Interest Context for Generative Recommendation arxiv cs.AI 2026-06-18
  13. [13] Contagion Networks: Evaluator Bias Propagation in Multi-Agent LLM Systems arxiv cs.AI 2026-06-18
  14. [14] Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems arxiv cs.AI 2026-06-18
  15. [15] Actionable Activation Directions for Detecting and Mitigating Emergent Misalignment Across Language Model Families arxiv cs.CL 2026-06-18
  16. [16] Calibration Without Comprehension: Diagnosing the Limits of Fine-Tuning LLMs for Vulnerability Detection in Systems Software arxiv cs.AI 2026-06-18
  17. [17] Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact arxiv cs.CL 2026-06-18
  18. [18] FlowEdit: Associative Memory for Lifelong Pronunciation Adaptation in Flow-Matching TTS arxiv cs.AI 2026-06-18
  19. [19] PASQA: Pitch-Accent-Focused Speech Quality Assessment Model Trained on Synthetic Speech with Accent Errors arxiv cs.CL 2026-06-18
  20. [20] Repurposing a Speech Classifier for Guided Diffusion-Based Speech Generation arxiv cs.AI 2026-06-18
  21. [21] CATCH-ME if you RAG: a dataset of Contextually Annotated multi-Turn Counterspeech against Hate and Misinformation Exchanges arxiv cs.CL 2026-06-18
  22. [22] Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages arxiv cs.AI 2026-06-18
  23. [23] CzechDocs: A Multiway Parallel Dataset of Formatted Documents for Minority Languages in Czechia arxiv cs.CL 2026-06-18
  24. [24] Optimal Deterministic Multicalibration and Omniprediction arxiv cs.LG 2026-06-18
  25. [25] Fisher-Geometric Sharpness and the Implicit Bias of SGD toward Flat Minima arxiv cs.LG 2026-06-18