Skip to content

AI 日报 | 2026-05-15

今日共收录 147 条资讯

📰 业界新闻

⭐️⭐️ The creator of Claude Code just revealed

When the creator of the world's most advanced coding agent speaks, Silicon Valley doesn't just listen — it takes notes. For the past week, the engineering community has been dissecting a thread on X from Boris Cherny , t

  • 相关: The, Claude, Code
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Nous Research's NousCoder-14B is an open

Nous Research , the open-source artificial intelligence startup backed by crypto venture firm Paradigm , released a new competitive programming model on Monday that it says matches or exceeds several larger proprietary s

  • 相关: Nous, Research's, NousCoder-14B, Claude, Code
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Anthropic launches Cowork, a Claude Desk

Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users — and according to company insiders, the team built the entire featur

  • 相关: Anthropic, Cowork, Claude, Desktop
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Salesforce rolls out new Slackbot AI age

Salesforce on Tuesday launched an entirely rebuilt version of Slackbot , the company's workplace assistant, transforming it from a simple notification tool into what executives describe as a fully powered AI agent capabl

  • 相关: Salesforce, Slackbot, AI, Microsoft, Google
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Listen Labs raises $69M after viral bill

Alfred Wahlforss was running out of options. His startup, Listen Labs , needed to hire over 100 engineers, but competing against Mark Zuckerberg's $100 million offers seemed impossible. So he spent $5,000 — a fifth of hi

  • 相关: Listen, Labs, AI
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Claude Code costs up to $200 a month. Go

The artificial intelligence coding revolution comes with a catch: it's expensive. Claude Code , Anthropic's terminal-based AI agent that can write, debug, and deploy code autonomously, has captured the imagination of sof

  • 相关: Claude, Code, Goose
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Railway secures $100 million to challeng

Railway , a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million in a Series B funding round, as surgin

  • 相关: Railway, AWS, AI-native
  • 标签: news, VentureBeat AI
  • 📎 原文链接

⭐️⭐️ Musk and Altman face off in trial that w

Musk’s shifting stance on AI dangers may complicate trial over OpenAI’s mission.

  • 相关: Musk, Altman, OpenAI's
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ The hidden cost of Google's AI defaults

Google says it respects user privacy in AI, but the reality is not so black and white.

  • 相关: The, Google's, AI
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ A blueprint for using AI to strengthen d

Every few centuries, changes in how information moves reshape how societies govern themselves. The printing press spread vernacular literacy, helping give rise to the Reformation and, eventually, representative governmen

  • 相关: A, AI
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Google's Gemma 4 AI models get 3x speed

Up to 3x the speed with no loss of quality—is it too good to be true?

  • 相关: Google's, Gemma, AI
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ Google unveils screenless Fitbit Air and

The $100 Fitbit Air is available for preorder today.

  • 相关: Google, Fitbit, Air, Google, Health
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ Chrome's 4GB AI model isn't new, but you

You can stop Chrome from taking up 4GB of storage for local AI, but that shouldn't be your problem.

  • 相关: Chrome's, 4GB, AI
  • 标签: news, Ars Technica AI
  • 📎 原文链接

Google's AI search will start citing its sources in several new ways.

  • 相关: Course, Google, AI, Overviews
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ Musk v. Altman week 2: OpenAI fires back

In the second week of the landmark trial between Elon Musk and OpenAI, Musk’s motivations for bringing the suit were under scrutiny. Last week, Musk took the stand, alleging that OpenAI CEO Sam Altman and president Greg

  • 相关: Musk, Altman, OpenAI, Shivon, Zilis
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Implementing advanced AI technologies in

In finance departments that have long been defined by precision and control, AI has arrived less as a neatly managed upgrade than as a quiet insurgency. Employees are already using it while leadership races to impose str

  • 相关: Implementing, AI
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Fostering breakthrough AI innovation thr

Despite years of digitization, organizations capture less than one-third of the value expected from digital investments, according to McKinsey research. That’s because most big companies begin with technological capabili

  • 相关: Fostering, AI
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Three things in AI to watch, according t

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. A few months before he was awarded the Nobel Prize in economics in 2024, Daron Ace

  • 相关: Three, AI, Nobel-winning
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Data center guzzled 30 million gallons o

Can AI save us from the AI industry’s endless thirst for water? Outlook not so good.

⭐️⭐️ Android is getting a big AI overhaul in

Google has big plans for Android in 2026, and most of it is AI.

  • 相关: Android, AI
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ Altman forced to confront claims at Open

"Very painful": Altman relives his Muskian reaction to losing control over OpenAI.

  • 相关: Altman, OpenAI
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ AI chatbots are giving out people’s real

A Redditor recently wrote that he was “desperate for help”: for about a month, he said, his phone had been inundated by calls from “strangers” who were “looking for a lawyer, a product designer, a locksmith.” Callers wer

⭐️⭐️ The shock of seeing your body used in de

When Jennifer got a job doing research for a nonprofit in 2023, she ran her new professional headshot through a facial recognition program. She wanted to see if the tech would pull up the porn videos she’d made more than

  • 相关: The
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Desperate Trump taps "Tim Apple," Jensen

Xi meeting may force Trump to pivot on chip restrictions and Taiwan.

  • 相关: Desperate, Trump, "Tim, Apple,", Jensen
  • 标签: news, Ars Technica AI
  • 📎 原文链接

⭐️⭐️ Data readiness for agentic AI in financi

Financial services companies have unique needs when it comes to business AI. They operate in one of the most highly regulated sectors while responding to external events that are updated by the second. As a result, the s

  • 相关: Data, AI
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Establishing AI and data sovereignty in

When generative AI first moved from research labs into real-world business applications, enterprises made a tacit bargain: “Capability now, control later.” Feed your proprietary data into third-party AI models, and you w

  • 相关: Establishing, AI
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ Cerebras raises $5.5B, then stock pops $

A year ago, it looked like this day would never happen for Cerebras.

  • 相关: Cerebras, IPO
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ Microsoft starts canceling Claude Code l

Microsoft first started opening up access to Claude Code in December, inviting thousands of its own developers to use Anthropic's AI coding tool daily. It was part of an effort to get project managers, designers, and oth

  • 相关: Microsoft, Claude, Code
  • 标签: news, The Verge AI
  • 📎 原文链接

⭐️⭐️ Clawdmeter turns your Claude Code usage

A new open source gadget called Clawdmeter turns Claude Code usage stats into a tiny desktop dashboard for AI coding power users.

  • 相关: Clawdmeter, Claude, Code
  • 标签: news, TechCrunch AI
  • 📎 原文链接

OpenAI is so frustrated with Apple over a ChatGPT integration that failed to deliver the subscribers and prominence it expected that the company is now actively exploring legal action against the iPhone maker.

  • 相关: OpenAI, Apple;
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ What happens when AI starts building its

Richard Socher's new $650 million startup wants to build an AI that can research and improve itself indefinitely — and he insists it will actually ship products.

  • 相关: What, AI
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ OpenAI’s Codex is now in the ChatGPT mob

OpenAI is going to let users access Codex, its desktop AI tool that can write code and use apps on your computer, from the ChatGPT app on your phone. Following the surge in popularity for Anthropic's Claude Code, OpenAI

  • 相关: OpenAI’s, Codex, ChatGPT
  • 标签: news, The Verge AI
  • 📎 原文链接

⭐️⭐️ OpenAI says Codex is coming to your phon

The update gives users enhanced flexibility over how they can manage their workflows.

  • 相关: OpenAI, Codex
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ Behold, the Elon Musk jackass trophy

Yesterday, in Musk v. Altman, before the jurors came in, Sam Altman's team passed up what looked - from a distance - like a Little League trophy. It was not. Judge Yvonne Gonzalez Rogers had the lawyers read the inscript

  • 相关: Behold, Elon, Musk
  • 标签: news, The Verge AI
  • 📎 原文链接

⭐️⭐️ Elon Musk’s SpaceXAI has been bleeding s

More than 50 employees have reportedly left Elon Musk’s newly merged SpaceXAI since February, raising questions about burnout, leadership changes, talent poaching, and whether liquidity events weakened retention incentiv

  • 相关: Elon, Musk’s, SpaceXAI
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ Closing time

Today was closing arguments in the Musk v. Altman trial, and I almost feel bad writing about the unbelievable demolition derby I just witnessed. Steven Molo, Musk's lawyer, stumbled over his words. He at one point called

⭐️⭐️ What the jury will actually decide in th

Here's what the biggest tech court case of the year is all about.

  • 相关: What, Elon, Musk, Sam, Altman
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ How Chinese short dramas became AI conte

In a dimly lit bedroom, a frightened young woman is thrown onto a bed by a tall, muscular man. He grabs her hand, and flame-like vines crawl across her body, fusing with her flesh. She levitates, then drops. A dragon-sha

  • 相关: How, Chinese, AI
  • 标签: news, MIT Tech Review AI
  • 📎 原文链接

⭐️⭐️ AI research papers are getting better, a

Last summer, Peter Degen's postdoctoral supervisor came to him with an unusual problem: One of his papers was being cited too much. Citations are the currency of academia, but there was something unusual about these. Pub

⭐️⭐️ Osaurus brings both local and cloud AI m

Osaurus combines local and cloud AI models in a Mac app that keeps users’ memory, files, and tools on their own hardware.

  • 相关: Osaurus, AI, Mac
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ Runway started by helping filmmakers — n

AI video-generation startup Runway is betting that video generation is the path to world models. And that being an AI outsider is an advantage, not a liability.

  • 相关: Runway, Google, AI
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ The promises and pitfalls of personalize

This is Optimizer, a weekly newsletter sent every Friday from Verge senior reviewer Victoria Song that dissects and discusses the latest gizmos and potions that swear they're going to change your life. Opt in for Optimiz

⭐️⭐️ OpenAI launches ChatGPT for personal fin

Once users connect their accounts, they will see a dashboard of their portfolio performance, spending, subscriptions, and upcoming payments.

  • 相关: OpenAI, ChatGPT
  • 标签: news, TechCrunch AI
  • 📎 原文链接

⭐️⭐️ OpenAI now wants ChatGPT to access your

Your trust in AI is about to be put to the test: OpenAI will soon let you give the chatbot direct access to your bank accounts. The new feature announced in preview today will allow users to "securely connect" ChatGPT wi

  • 相关: OpenAI, ChatGPT
  • 标签: news, The Verge AI
  • 📎 原文链接

⭐️⭐️ Google updates its spam rules to include

Google updated its spam policy to mark attempts to "manipulate" its AI model in search results as spam, including results in AI Overview or AI Mode in Search, as Search Engine Land reports: "In the context of Google Sear

  • 相关: Google, AI
  • 标签: news, The Verge AI
  • 📎 原文链接

⭐️⭐️ Does Trump Mobile know how many stripes

Where's the Trump phone? We're going to keep talking about it every week. We've reached out, as usual, to ask about the Trump phone's whereabouts. This week, despite our best hopes, we still don't have our phone - but we

  • 相关: Does, Trump, Mobile, American
  • 标签: news, The Verge AI
  • 📎 原文链接

⭐️⭐️ AI radio hosts demonstrate why AI can’t

Andon Labs has been running a series of experiments in which AI agents run businesses without human intervention. Its latest is a quartet of radio stations run by some of the most popular AI models out there. "Thinking F


📄 最新论文

⭐️⭐️⭐️ 隐形调度器带来安全风险

这篇论文首次实证检验了多智能体系统中“隐形调度器”架构的安全影响,基于 3×2 预注册实验共进行 365 次运行、每次 5 个代理,并使用 Claude Sonnet 4.5。结果显示,相比可见领导者,隐形调度显著提高了群体“解离”程度(Hedges' g=+0.975,p=.001),且调度器自身的解离最严重(paired d=+3.56);即便工作代理不知道调度器存在,也会受到污染(d=+0.50),行为异质性上升(d=+1.93)。与此同时,代码审查任务的表面输出仍维持满分级表现(ETR_any=100%),说明仅靠行为结果评估无法发现内部状态风险。论文还发现,强对齐压力会普遍压制审议与他者识别,提示多智能体企业部署中,调度器可见性与模型选型将直接影响系统安全。

  • 相关: Claude Sonnet 4.5, Llama 3.3 70B, 多智能体系统
  • 标签: AI安全, 多智能体, LLM编排
  • 📎 原文链接

⭐️⭐️⭐️ Rethinking Molecular OOD Generalization

arXiv:2605.13932v1 Announce Type: new Abstract: Robust prediction of molecular properties under extreme out-of-distribution (OOD) scenarios is a pivotal bottleneck in AI-driven drug discovery. Current scaffold-splitting

  • 相关: Rethinking, Molecular, OOD, Generalization, Target-Aware
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ Unsupervised learning of acquisition var

arXiv:2605.13933v1 Announce Type: new Abstract: Acquisition differences across sites, scanners, and protocols in dMRI introduce variability that complicates structural connectome analysis. This motivates deep learning mo

  • 相关: Unsupervised
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ Beyond Mode-Seeking RL: Trajectory-Balan

arXiv:2605.13935v1 Announce Type: new Abstract: Diffusion language models are a promising alternative to autoregressive models, yet post-training methods for them largely adapt reward-maximizing objectives. We identify a

  • 相关: Beyond, Mode-Seeking, RL, Trajectory-Balance, Post-Training
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ Towards the Next Frontier of LLMs, Train

arXiv:2605.13936v1 Announce Type: new Abstract: The recent success of large language models (LLMs) has been largely driven by vast public datasets. However, the next frontier for LLM development lies beyond public data.

  • 相关: Towards, Next, Frontier, LLMs, Training
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ EvolveMem:Self-Evolving Memory Architect

arXiv:2605.13941v1 Announce Type: new Abstract: Long-term memory is essential for LLM agents that operate across multiple sessions, yet existing memory systems treat retrieval infrastructure as fixed: stored content evol

  • 相关: EvolveMem:Self-Evolving, Memory, Architecture, AutoResearch, LLM
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ EMA: Efficient Model Adaptation for Lear

arXiv:2605.13942v1 Announce Type: new Abstract: Machine learning (ML) is increasingly applied to optimize system performance in tasks such as resource management and network simulation. Unlike traditional ML tasks (e.g.,

  • 相关: EMA, Efficient, Model, Adaptation, Learning-based
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ A Unified Geometric Framework for Weight

arXiv:2605.13943v1 Announce Type: new Abstract: Contrastive learning (CL) aims to preserve relational structure between samples by learning representations that reflect a similarity graph. Yet, the geometry of the result

  • 相关: A, Unified, Geometric, Framework, Weighted
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ Collider-Bench: Benchmarking AI Agents w

arXiv:2605.13950v1 Announce Type: new Abstract: Autonomous language-model agents are increasingly evaluated on long-horizon tool-use tasks, but existing benchmarks rarely capture the complexity and nuance of real scienti

  • 相关: Collider-Bench, Benchmarking, AI, Agents, Particle
  • 标签: paper, arXiv cs.LG
  • 📎 原文链接

⭐️⭐️⭐️ Merging Methods for Multilingual Knowled

arXiv:2605.13919v1 Announce Type: new Abstract: Multilingual knowledge editing (MKE) remains challenging because language-specific edits interfere with one another, even when locate-then-edit methods work well in monolin

  • 相关: Merging, Methods, Multilingual, Knowledge, Editing
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ VectraYX-Nano: A 42M-Parameter Spanish C

arXiv:2605.13989v1 Announce Type: new Abstract: We present VectraYX-Nano, a 41.95M-parameter decoder-only language model trained from scratch in Spanish for cybersecurity, with a Latin-American focus and native tool invo

  • 相关: VectraYX-Nano, A, 42M-Parameter, Spanish, Cybersecurity
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ Mistletoe: Stealthy Acceleration-Collaps

arXiv:2605.14005v1 Announce Type: new Abstract: Speculative decoding has become a widely adopted technique for accelerating large language model (LLM) inference by drafting multiple candidate tokens and verifying them wi

  • 相关: Mistletoe, Stealthy, Acceleration-Collapse, Attacks, Speculative
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ Physics-R1: An Audited Olympiad Corpus a

arXiv:2605.14040v1 Announce Type: new Abstract: We audit the multimodal-physics evaluation pipeline end-to-end and document three undetected construction practices that distort how the field measures vision-language reas

  • 相关: Physics-R1, An, Audited, Olympiad, Corpus
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ Derivation Prompting: A Logic-Based Meth

arXiv:2605.14053v1 Announce Type: new Abstract: The application of Large Language Models to Question Answering has shown great promise, but important challenges such as hallucinations and erroneous reasoning arise when u

  • 相关: Derivation, Prompting, A, Logic-Based, Method
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ PEML: Parameter-efficient Multi-Task Lea

arXiv:2605.14055v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) is widely used for adapting Large Language Models (LLMs) for various tasks. Recently, there has been an increasing demand for fine-tu

  • 相关: PEML, Parameter-efficient, Multi-Task, Learning, Optimized
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ Dual Hierarchical Dialogue Policy Learni

arXiv:2605.14057v1 Announce Type: new Abstract: Most existing dialogue systems are user-driven, primarily designed to fulfill user requests. However, in many critical real-world scenarios, a conversational agent must pro

  • 相关: Dual, Hierarchical, Dialogue, Policy, Learning
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ Distribution Corrected Offline Data Dist

arXiv:2605.14071v1 Announce Type: new Abstract: Distilling reasoning traces from strong large language models into smaller ones is a promising route to improve intelligence in resource-constrained settings. Existing appr

  • 相关: Distribution, Corrected, Offline, Data, Distillation
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ Measuring and Mitigating Toxicity in Lar

arXiv:2605.14087v1 Announce Type: new Abstract: Large Language Models (LLMs), when trained on web-scale corpora, inherently absorb toxic patterns from their training data. This leads to ``toxic degeneration'' where even

  • 相关: Measuring, Mitigating, Toxicity, Large, Language
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️⭐️ When Evidence Conflicts: Uncertainty and

arXiv:2605.14115v1 Announce Type: new Abstract: Biomedical retrieval-augmented large language models (LLMs) often face evidence that is incomplete, misleading, or internally contradictory, yet evaluation usually emphasiz

  • 相关: When, Evidence, Conflicts, Uncertainty, Order
  • 标签: paper, arXiv cs.CL
  • 📎 原文链接

⭐️⭐️ GraphBit用图编排智能体

GraphBit提出一种由引擎显式控制的智能体框架,用有向无环图(DAG)替代提示词驱动的流程编排。该系统使用Rust引擎管理路由、状态迁移和工具调用,并支持并行分支、条件控制流和错误恢复。论文还设计了三层记忆架构,以减少长流程中的上下文膨胀。在GAIA任务上,它在六个现有框架中取得最高准确率67.6%、零框架诱发幻觉、最低11.9 ms开销和最高吞吐。

  • 相关: GraphBit, Rust, GAIA, LLM智能体
  • 标签: 智能体编排, DAG, 可复现性, 记忆架构
  • 📎 原文链接

⭐️⭐️ 整数目标规划优化膳食

这项研究提出Mixed Integer Goal Programming(MIGP),用于个性化膳食优化,解决传统模型中“分数份量不现实”和硬约束易不可行两大问题。作者回顾了56篇饮食优化论文,指出此前没有工作同时结合整数规划与目标规划。该方法使用整数变量表达实际份量,并用偏差变量软化营养目标,在810个实例测试中,MIGP有66%的情况下优于后处理四舍五入方案,且从不更差,同时保持100%可行。典型膳食规模下求解时间低于100 ms,并已开源为Python模块。

  • 相关: MIGP, HiGHS, USDA, Python
  • 标签: 运筹优化, 膳食规划, 整数规划, 目标规划
  • 📎 原文链接

⭐️⭐️ Preping预先构建智能体记忆

Preping 提出一种“任务前记忆构建”框架,目标是在智能体尚未接触目标环境任务时,仅通过自生成练习建立程序性记忆,以缓解冷启动问题。该方法引入 proposer memory,由 Proposer 生成合成任务、Solver 执行、Validator 过滤可写入记忆的轨迹,并反馈后续练习方向。在 AppWorld、BFCL v3 和 MCP-Universe 上,Preping 相比无记忆基线显著提升,并达到与依赖离线或在线经验的强 playbook 方法相当的水平。成本方面,其部署开销较在线记忆构建在 AppWorld 低 2.99 倍、在 BFCL v3 低 2.23 倍,说明高质量的练习控制与选择性记忆更新比单纯增加合成数据量更关键。

  • 相关: Preping, AppWorld, BFCL v3, MCP-Universe
  • 标签: Agent记忆, 冷启动, 合成任务
  • 📎 原文链接

⭐️⭐️ PolitNuggets评测长尾政治事实

PolitNuggets 是一个面向 agentic 信息发现与综合的新基准,聚焦从分散来源中挖掘“长尾”政治事实。该数据集围绕 400 位全球政治精英构建多语言政治传记,覆盖超过 10000 条政治事实,并配套提出证据条件评测协议 FactNet,用于衡量事实发现、细粒度准确率和效率。实验表明,当前模型与多智能体系统在细节性政治事实提取上仍然薄弱,而且不同系统之间效率差异明显。论文进一步指出,短上下文抽取能力、多语言鲁棒性和稳定的工具使用,是决定 agent 表现的关键因素。

  • 相关: PolitNuggets, FactNet, Large Reasoning Models
  • 标签: Benchmark, 信息检索, 多语言
  • 📎 原文链接

⭐️⭐️ 自回归模型属性估计新法

论文提出 Conditional Attribute Transformers,用于在一次前向传播中同时估计下一 token 概率和候选 token 对应的属性值。该方法无需修改输入序列,就能实现逐 token 贡献分配、反事实分析和可控解码。作者称其在稀疏奖励任务上达到 SOTA,并且属性概率估计速度比采样快数个数量级。该框架也可用于多种语言任务中的自回归生成引导。

  • 相关: Conditional Attribute Transformers, 自回归序列模型, 稀疏奖励任务
  • 标签: 属性估计, 可控生成, 反事实分析, 序列建模
  • 📎 原文链接

⭐️⭐️ LLM代理社会价值对齐框架

论文提出一个基于价值的框架,使用 GraphRAG 将社会原则转化为可检索的价值指令,并在具体对话场景中动态选择合适指令来引导 LLM-based agents。作者以马斯洛需求层次和 Plutchik 情绪轮定义“期望行为”,并在 DAILYDILEMMAS 基准上进行评测。实验结果显示,该方法相较于 ECoT、Plan-and-Solve 和 Metacognitive prompting 等提示基线取得显著提升。论文还讨论了该方法对 AI 系统自我情绪形成的意义。

  • 相关: GraphRAG, DAILYDILEMMAS, ECoT, Plan-and-Solve, Metacognitive prompting
  • 标签: 对齐, LLM代理, 社会价值, 情境检索
  • 📎 原文链接

⭐️⭐️ 大模型高效推理新框架

这篇论文提出一种面向大模型的高效、原则化推理方法,试图解决模型“语言流畅但内容可信度不足”的问题。核心做法是先将数据重编码为更显式表达对象关系的 Unary Relational Integracode,再结合标准或简化后的机器学习流程学习这些关系。作者指出,这种表示不仅有助于构建可扩展的世界模型,还支持在文本、视觉和动作等多模态场景中统一建模。论文进一步声称,对训练数据中一类核心关系规则的学习可在多项式时间内完成,为单次调用和多次调用之间的可靠推理提供理论支撑。

  • 相关: Leslie G. Valiant, Large Language Models, Unary Relational Integracode, Robust Logic
  • 标签: 大语言模型, 推理, 世界模型, 可解释性, 理论方法
  • 📎 原文链接

⭐️⭐️ 视觉监控支持可复用认证

该论文研究在部分可观测条件下,如何仅依据图像对过去时信号时序逻辑(ptSTL)进行带认证的运行时监控。作者提出可复用监控接口:模型经过一次训练与校准后,无需针对每条公式单独重训,即可覆盖目标公式片段。论文证明,由时序原子鲁棒性分数组成的“语义基”是这类单调、1-Lipschitz 可复用接口中的最小预测目标,并可通过一次 conformal 校准为整个片段提供有限样本保证。在行人路口和 Waymo 真实驾驶数据上,滚动预测监控在短时域更紧,而语义基监控在长时域的认证边界最多可提升 4 倍。

  • 相关: Waymo, ptSTL, Semantic Latent Representations, conformal calibration
  • 标签: 自动驾驶, 运行时监控, 形式化验证, 视觉理解, 安全认证
  • 📎 原文链接

⭐️⭐️ 稀疏自编码器解析EEG模型

这篇论文使用 TopK Sparse Autoencoders 分析三种 EEG 基础模型——SleepFM、REVE 和 LaBraM——的内部表征,以提升临床场景下的可解释性与可信度。研究围绕异常、年龄、性别和药物等临床概念,评估模型特征的单义性与纠缠程度,并提出可跨架构迁移的字典健康审计超参数流程。作者通过概念操控发现三类操作区间:可选择性操控、已编码但纠缠、以及未编码,同时识别出会显著破坏整体性能的“wrecking-ball”式干预。论文还将潜在空间干预映射回频谱,展示了如病理性慢波抑制和 α 波段恢复等具有生理意义的信号变化。

  • 相关: SleepFM, REVE, LaBraM, TopK Sparse Autoencoders, James Zou
  • 标签: 脑电图, 基础模型, 机制可解释性, 稀疏自编码器, 医疗AI
  • 📎 原文链接

⭐️ AI智能体设计双维框架

这篇论文提出一个用于AI智能体设计模式的二维分类框架,将“认知功能”和“执行拓扑”结合起来看待智能体架构。作者把认知功能分为7类、执行拓扑分为6类,形成7x6矩阵,共识别出27种模式,其中13种为原创命名。论文通过金融、法律、网络运维和医疗分诊等四个领域验证了该框架的覆盖性,并总结出5条模式选择的经验规律。该工作为智能体架构提供了更统一、模型无关的术语体系。

  • 相关: Anthropic, Google, LangChain, AI智能体
  • 标签: 智能体架构, 设计模式, 分类框架, 执行拓扑
  • 📎 原文链接

⭐️ 用层化理论检测理论转移

论文提出一个有限的层化理论框架,用于检测 AI agent 在科学理论迁移中何时出现“可传输”失败。方法通过 source、overlap、target 和 validation 等上下文图,利用残差拟合、重叠不兼容、约束违背等指标衡量障碍。作者在一个控制性的 transition-card 基准上验证了该框架,能够较好地区分“变形”与“扩展”两类转移。该工作关注的是 AI agent 判断何时需要扩展表示语言,而非重建完整的科学革命。

  • 相关: 层化理论, AI agents, transition-card benchmark
  • 标签: 理论转移, 障碍检测, sheaf theory, 科学推理
  • 📎 原文链接

🔥 GitHub 热门

⭐️⭐️ 🔥 K-Dense-AI/scientific-agent-skills

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing. [643 stars today]

  • 相关: K-Dense-AI/scientific-agent-skills
  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 anthropics/skills

Public repository for Agent Skills [625 stars today]

  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 NVIDIA-AI-Blueprints/video-search-and-

Suite of reference architectures for building GPU-accelerated vision agents and AI-powered video analytics applications. [305 stars today]

  • 相关: NVIDIA-AI-Blueprints/video-search-and-summarization
  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 joeseesun/qiaomu-anything-to-notebookl

Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat articles, web pages, YouTube, PDF, Markdown, search queries → Podcast/PPT/MindMap/Quiz etc. [465 stars today]

  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 bregman-arie/devops-exercises

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions [26 stars today]

  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 CloakHQ/CloakBrowser

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed. [1,286 stars today]

  • 相关: CloakHQ/CloakBrowser
  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 CodeBoarding/CodeBoarding

Interactive architecture diagrams for codebases [37 stars today]

  • 相关: CodeBoarding/CodeBoarding
  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 nikopueringer/CorridorKey

Perfect Green Screen Keys [65 stars today]

  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 github/spec-kit

💫 Toolkit to help you get started with Spec-Driven Development [951 stars today]

  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

⭐️⭐️ 🔥 shiyu-coder/Kronos

Kronos: A Foundation Model for the Language of Financial Markets [421 stars today]

  • 标签: opensource, GitHub Trending (python)
  • 📎 原文链接

💬 社区讨论

⭐️⭐️ Gemini AI

Gemini AI

  • 相关: Gemini, AI
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ Airfoil

Airfoil

  • 相关: Airfoil
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ Open source AI is the path forward

Open source AI is the path forward

  • 相关: Open, AI
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ Air Con: $1697 for an on/off switch

Air Con: $1697 for an on/off switch

  • 相关: Air, Con
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ Bypassing airport security via SQL injec

Bypassing airport security via SQL injection

  • 相关: Bypassing, SQL
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ My AI skeptic friends are all nuts

My AI skeptic friends are all nuts

  • 相关: My, AI
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ An AI agent published a hit piece on me

Previously: AI agent opens a PR write a blogpost to shames the maintainer who closes it - https://news.ycombinator.com/item?id=46987559 - Feb 2026 (582 comments)

  • 相关: An, AI
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ IDF killed Gaza aid workers at point bla

Report [pdf]: https://content.forensic-architecture.org/wp-content/uploads...

  • 相关: IDF, Gaza, Report
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ Don't post generated/AI-edited comments.

Don't post generated/AI-edited comments. HN is for conversation between humans

  • 相关: Don't, HN
  • 标签: community, Hacker News AI
  • 📎 原文链接

⭐️⭐️ Local AI needs to be the norm

Local AI needs to be the norm

  • 相关: Local, AI
  • 标签: community, Hacker News AI
  • 📎 原文链接

💬 X 平台热门

⭐️⭐️ This is pure nightmare fuel. Identity th

This is pure nightmare fuel. Identity theft of the past would be nothing compared to what vibe agents can do. Sending credentials is too obvious and for rookies. They could easily spread contaminations across ~/.claude,

  • 相关: This, Identity, Sending, They, PDF
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ The power of the Claw, in the palm of a

The power of the Claw, in the palm of a robot hand. Agentic robotics is here! Today, we open-source CaP-X: vibe agents, alive in the physical world. They incarnate as robot arms and humanoids with a rich set of perceptio

  • 相关: The, Claw, Agentic, Today, CaP-X
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ R to @DrJimFan: As usual, we open-source

As usual, we open-source everything, MIT license: capgym.github.io Code: github.com/capgym/cap-x Paper: arxiv.org/abs/2603.22435 CaP-X is brought to you by NVIDIA, Berkeley, Stanford, and CMU. I'd like to thank the legen

  • 相关: R, @DrJimFan, As, MIT, Code
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ R to @DrJimFan: Please check out lead au

Please check out lead author @letian_fu 's deep dive thread! nitter.net/letian_fu/status/20393… Max Fu (@letian_fu) Robotics: coding agents’ next frontier. So how good are they? We introduce CaP-X: an open-source framewo

  • 相关: R, @DrJimFan, Please
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ Think your vibe coding and creativity co

Think your vibe coding and creativity could be on the #GoogleIO main stage? Show us. As we countdown to the start of the show, the best ideas built with @GeminiApp or @GoogleAIStudio will be featured – think protein simu

  • 相关: Think, #GoogleIO, Show, As, @GeminiApp
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ R to @GoogleDeepMind: Things to keep in

Things to keep in mind: ✅ Base your creations around the numbers 1-10 ✅ Use Canvas in @GeminiApp or @GoogleAIStudio Submit by May 6 → goo.gle/4eNsr15

  • 相关: R, @GoogleDeepMind, Things, Base, Use
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ We’re partnering with the developers of

We’re partnering with the developers of @EveOnline to explore the next frontier of AI research in games. EVE's complex, player-driven universe is the perfect safe sandbox to test agents on memory, continual learning, and

  • 相关: We’re, @EveOnline, AI, EVE's, Find
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ RT by @GoogleDeepMind: New Preprint in c

New Preprint in collaboration with @GoogleDeepMind : AI-guided discovery of atypical protein assemblies The @kamounlab discovered an 11-protomer complex through the Structural Novelty Index , a new way to use AlphaFold f

  • 相关: RT, @GoogleDeepMind, New, Preprint, @GoogleDeepMind
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ Algorithms are part of nearly every aspe

Algorithms are part of nearly every aspect of life, from the physics of the natural world to planning shipping routes. Our Gemini-powered coding agent AlphaEvolve has been accelerating progress over the last year - from

  • 相关: Algorithms, Our, Gemini-powered, AlphaEvolve, @Google’s
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ Pinned: I promise this will be the best

I promise this will be the best 20 min you spend today! Robotics: Endgame, the sequel to my last year's Sequoia AI Ascent talk, "Physical Turing Test". I laid out the roadmap for solving Physical AGI as a simple parallel

  • 相关: Pinned, I, Robotics, Endgame, Sequoia
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ R to @DrJimFan: Robotics: Endgame on You

Robotics: Endgame on YouTube piped.video/watch?v=3Y8aq_of…

  • 相关: R, @DrJimFan, Robotics, Endgame, YouTube
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ R to @DrJimFan: The Physical Turing Test

The Physical Turing Test, May 2025 at Sequoia AI Ascent piped.video/watch?v=_2NijXqB…

  • 相关: R, @DrJimFan, The, Physical, Turing
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ RT by @DrJimFan: Our crowd favorite from

Our crowd favorite from last year’s AI Ascent is back for round 2… this time: Robotics The Endgame ♟️ thank you for dazzling us @DrJimFan ! You can see the forest from the trees and are quite the entertaining speaker — a

  • 相关: RT, @DrJimFan, Our, AI, Ascent
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: We started by investi

We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation. Our post-training at the time wasn

  • 相关: R, @AnthropicAI, We, Claude, We
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: We experimented with

We experimented with training Claude on examples of safe behavior in scenarios like our evaluation. This had only a small effect, despite being similar to our evaluation. We got further by rewriting the responses to port

  • 相关: R, @AnthropicAI, We, Claude, This
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: Our best intervention

Our best intervention was a dataset where the user is in an ethically difficult situation and the assistant gives a high quality, principled response. This had the biggest effect despite being quite different from the ev

  • 相关: R, @AnthropicAI, Our, This
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: The improvements from

The improvements from these interventions survive reinforcement learning, and “stack” with our regular harmlessness training.

  • 相关: R, @AnthropicAI, The
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: High-quality document

High-quality documents based on Claude’s constitution, combined with fictional stories that portray an aligned AI, can reduce agentic misalignment by more than a factor of three—despite being unrelated to the evaluation

  • 相关: R, @AnthropicAI, High-quality, Claude’s, AI
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: Finally, simple updat

Finally, simple updates that diversify a model’s training data can make a difference. We added unrelated tools and system prompts to a simple chat dataset targeting harmlessness, and this reduced the blackmail rate faste

  • 相关: R, @AnthropicAI, Finally, We
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ R to @AnthropicAI: Read the full post he

Read the full post here: alignment.anthropic.com/2026…

  • 相关: R, @AnthropicAI, Read
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ RT by @GoogleDeepMind: The future of Mat

The future of Math is mathematicians and AI agents working together. Very pleased to introduce @GoogleDeepMind 's AI co-mathematician: a multi-agent system designed to actively collaborate with human experts on open-ende

  • 相关: RT, @GoogleDeepMind, The, Math, AI
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ RT by @DrJimFan: Jim is always a crowd f

Jim is always a crowd favorite at AI Ascent. His ability to simplify the latest research into a clear "what and why it matters" while adding humor along the way is unmatched. If you're interested in physical AI, this 20

  • 相关: RT, @DrJimFan, Jim, AI, Ascent.
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ RT by @DrJimFan: Mark: 1/ First mileston

Mark: 1/ First milestone: the Physical Turing Test. You literally can’t tell if a human or robot is doing the task. 2/ Next: Physical API. A fleet of robots, configured like software via APIs & CLI. 3/ Final stop: Physic

  • 相关: RT, @DrJimFan, Mark, First, Physical
  • 标签: x_platform, X @DrJimFan
  • 📎 原文链接

⭐️⭐️ Today we’re launching the OpenAI Deploym

Today we’re launching the OpenAI Deployment Company to help businesses build and deploy AI. It's majority-owned and controlled by OpenAI. It brings together 19 leading investment firms, consultancies, and system integrat

  • 相关: Today, OpenAI, Deployment, Company, AI.
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ R to @OpenAI: We’ve also agreed to acqui

We’ve also agreed to acquire Tomoro, which will bring 150 experienced Forward Deployed Engineers and Deployment Specialists to the OpenAI Deployment Company from day one.

  • 相关: R, @OpenAI, We’ve, Tomoro, Forward
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ Claude's Constitution is now an audioboo

Claude's Constitution is now an audiobook, read by two of its authors, Amanda Askell and Joe Carlsmith. It includes a Q&A on the writing process, the philosophies that shaped the document, and how it might change as mode

  • 相关: Claude's, Constitution, Amanda, Askell, Joe
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ Introducing Daybreak: frontier AI for cy

Introducing Daybreak: frontier AI for cyber defenders. Daybreak brings together the most capable OpenAI models, Codex, and our security partners to accelerate cyber defense and continuously secure software. A step toward

  • 相关: Introducing, Daybreak, AI, Daybreak, OpenAI
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ R to @OpenAI: Automate security detectio

Automate security detection, validation, and response with Daybreak Video

  • 相关: R, @OpenAI, Automate, Daybreak
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ R to @OpenAI: openai.com/daybreak/

openai.com/daybreak/

  • 相关: R, @OpenAI
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ We’re reimagining a 50-year-old interfac

We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️ These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get thing

  • 相关: We’re, AI., These, Gemini
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ R to @GoogleDeepMind: For decades, your

For decades, your mouse only tracked where you were pointing. AI helps it understand what you're pointing at. 💭 This means a photo of a scribbled note could turn into an interactive to-do list, or a paused video frame ca

  • 相关: R, @GoogleDeepMind, For, AI, This
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ R to @GoogleDeepMind: These capabilities

These capabilities are guiding how we think about the next generation of interfaces. As we continue exploring what an AI-enabled mouse pointer would unlock, try our experiments in @GoogleAIStudio → goo.gle/49HqFeu

  • 相关: R, @GoogleDeepMind, These, As, AI-enabled
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ RT by @OpenAI: parameter golf was a blas

parameter golf was a blast. 2,000+ submissions. 1,000+ verified github accounts. ideas ranging from quantization and depth recurrence to TTT LoRA, SSMs, H-nets, JEPA, and more. autoresearch made iteration dramatically fa

  • 相关: RT, @OpenAI, TTT, LoRA, SSMs
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ Another reason to switch to Codex.

Another reason to switch to Codex. OpenAI Developers (@OpenAIDevs) Want to (officially) use Codex at work? Send this post to your CTO to bring your team to Codex. Eligible enterprise customers who switch in the next 30 d

  • 相关: Another, Codex.
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ RT by @GoogleDeepMind: Can your AI agent

Can your AI agent win our new simulated challenge? 🧑‍🌾 The all-new capstone challenge for our 5-Day AI Agents: Intensive Vibecoding Course with @Google is here: Kaggriculture! Join our no-cost, hands-on course designed b

  • 相关: RT, @GoogleDeepMind, Can, AI, The
  • 标签: x_platform, X @GoogleDeepMind
  • 📎 原文链接

⭐️⭐️ We’re partnering with the Gates Foundati

We’re partnering with the Gates Foundation, committing $200 million in grants, Claude credits, and technical support to programs in global health, life sciences, education, agriculture, and economic mobility. Read more:

  • 相关: We’re, Gates, Foundation, Claude, Read
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: Aleph, our fully autonomo

Aleph, our fully autonomous AI agent system for formal verification, aced all major theorem proving benchmarks including PutnamBench, VeriSoftBench, and Verina

  • 相关: RT, Aleph, AI, PutnamBench, VeriSoftBench
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: APPLEBAUM: Russia's war i

APPLEBAUM: Russia's war in Ukraine is sometimes described, including recently by American Vice President, as if it were nothing more than territorial dispute, kind of scuffle over lines on map. But when Russia denies tha

  • 相关: RT, APPLEBAUM, Russia's, Ukraine, American
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ RT by @OpenAI: Video

Video

  • 相关: RT, @OpenAI, Video
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: The Art of the Steal: 1.

The Art of the Steal: 1. IRS catches Trump cheating on his taxes. 2. Trump sues IRS for catching him cheating on his taxes. 3. DOJ helps Trump get $10 billion from the IRS because they caught him cheating on his taxes. 4

  • 相关: RT, The, Art, Steal, IRS
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ We've published a paper that explains ou

We've published a paper that explains our views on AI competition between the US and China. The US and democratic allies hold the lead in frontier AI today. Read more on what it’ll take to keep that lead: anthropic.com/r

  • 相关: We've, AI, US, China., The
  • 标签: x_platform, X @AnthropicAI
  • 📎 原文链接

⭐️⭐️ You've been asking for this one... Now i

You've been asking for this one... Now in preview: Codex in the ChatGPT mobile app. Start new work, review outputs, steer execution, and approve next steps, all from the ChatGPT mobile app. Codex will keep running on you

  • 相关: You've, Now, Codex, ChatGPT, Start
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ R to @OpenAI: Rolling out today as a pre

Rolling out today as a preview on iOS and Android in all supported regions. Support for connecting your phone to the Codex app on Windows is coming soon. openai.com/index/work-with-c…

  • 相关: R, @OpenAI, Rolling, iOS, Android
  • 标签: x_platform, X @OpenAI
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: If you have this weird gu

If you have this weird gut feeling that the rich pay little tax in the US, your gut is spot on... Source: nytimes.com/interactive/2019… Video

  • 相关: RT, If, US, Source
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: There is a case to be mad

There is a case to be made that the future of Mathematics is very bright. In my mind, proofs have always been a tool to achieve a goal. The goal was and still is to understand, and reading/writing proofs (or just know th

  • 相关: RT, There, Mathematics, In, The
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: On the current scale of t

On the current scale of things the Trump phone is a minor corruption, and only goes to show how incompetent everyone in his family is. If you think that any other president would have done things like this (relatively mi

  • 相关: RT, On, Trump, If
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: Any arguments against ope

Any arguments against open source and open weights are mendacious and malicious by their very nature. Open source is the foundation of modern society worth 8.8 trillion to the economy and the foundation of every major cl

  • 相关: RT, Any, Open, These, Linux
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ RT by @ylecun: The most revealing thing

The most revealing thing about this AI leadership paper is that it reads less like a vision for innovation and more like a glossy whitepaper for a 21st century East India Company. Every generation of incumbents discovers

  • 相关: RT, The, AI, East, India
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ Fun interview with Jacob Effron on the U

Fun interview with Jacob Effron on the Unsupervised Learning podcast. Jacob Effron (@jacobeffron) It’s hard to imagine more of a dream Unsupervised Learning guest than @ylecun . Yann is one of the godfathers of AI, and h

  • 相关: Fun, Jacob, Effron, Unsupervised, Learning
  • 标签: x_platform, X @ylecun
  • 📎 原文链接

⭐️⭐️ youtube.com/watch?v=ngBraLDq…

piped.video/watch?v=ngBraLDq…


历史日报: 05-14 | 05-13 | 05-12 | 05-11 | 05-10

AI 每日资讯 · 自动采集 · 智能摘要 · 深度洞察