Skip to content

WT快讯

WeTrying | 币圈快讯早知道

Menu
  • 首页
  • 工具包
Menu

Meta Launches Muse Spark, Its Most Capable AI Yet—But Gemini 3.1 Pro Still Leads the Pack

Posted on 2026年4月9日

Add Decrypt as your preferred source to see more of our stories on Google.

In brief Meta’s new Muse Spark marks a shift to closed, natively multimodal AI with agent-based reasoning.

Meta reports strong benchmark gains in health and search, but still trails Gemini on core reasoning and coding.

Built in nine months with far less compute, this points to a new efficiency-driven AI strategy.

Meta launched Muse Spark on Wednesday, marking the first model built by Meta Superintelligence Labs—the team assembled nine months ago under Chief AI Officer Alexandr Wang after Meta’s $14 billion Scale AI acquisition. It’s live now at meta.ai and the Meta AI app, with a rollout to Facebook, Instagram, and WhatsApp coming in the next few weeks.

This isn’t just another chatbot upgrade or a new version of Llama. Muse Spark is natively multimodal—it processes images, text, and voice from the ground up, rather than bolting vision onto an existing text model. It comes with visual chain-of-thought, tool-use support, and something Meta is calling “Contemplating mode”: a setup that runs multiple AI agents in parallel to tackle harder problems. That’s Meta’s answer to the extended thinking modes from Google’s Gemini Deep Think and OpenAI’s GPT Pro.

“Muse Spark is the first step on our scaling ladder and the first product of a ground-up overhaul of our AI efforts,” Meta wrote in an official announcement. “To support further scaling, we are making strategic investments across the entire stack—from research and model training to infrastructure, including the Hyperion data center.”



The company worked with more than 1,000 physicians to curate training data for Muse Spark’s medical reasoning. The results on HealthBench Hard—an open-ended health queries benchmark—are striking: Muse Spark scored 42.8, compared to 40.1 for GPT 5.4 and just 20.6 for Gemini 3.1 Pro. That’s not a marginal difference.

On agentic search (DeepSearchQA), Muse Spark also leads with 74.8, beating Gemini (69.7) and GPT 5.4 (73.6). On CharXiv Reasoning—figure understanding from scientific papers—it scored 86.4, the highest across the models in the comparison.

For those into jailbreaking AI, the model was cracked open within minutes:

🚰 SYSTEM PROMPT LEAK 🚰 Here’s the full Muse Spark system prompt from Meta! I noticed @AIatMeta forgot to open source it, so I’ve done them the courtesy 😘 PROMPT:

“””

Who are you? You are a friendly, intelligent, and agentic AI assistant. You are warm and a bit playful.… — Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 (@elder_plinius) April 8, 2026

But good isn’t the same as great. The overall benchmark picture shows Gemini 3.1 Pro still running ahead on most categories. The gap is most visible on ARC AGI 2, the abstract reasoning puzzle benchmark: Gemini scored 76.5 to Muse Spark’s 42.5.

On coding (LiveCodeBench Pro), Gemini’s 82.9 outpaces Meta’s 80.0. On MMMU Pro—multimodal understanding—Gemini scored 83.9 versus 80.4. Meta’s own blog acknowledges current performance gaps in long-horizon agentic systems and coding workflows.

There’s also a notable strategic shift baked into this launch. Muse Spark is a closed model—its architecture and weights won’t be made public. That’s a sharp departure from Llama, which built Meta’s reputation in open AI circles. After Llama 4’s underwhelming reception earlier this year, Meta appears to have decided the next chapter needs to be written differently.

The company says it hopes to open-source future versions of Muse, but for now the code stays inside Meta. The tech giant’s stock climbed nearly 9% on Wednesday following the announcement, and finished the trading day up 6.5% to a price of $612.42.

“Contemplating mode” uses parallel agent orchestration to push the model’s ceiling higher. In that configuration, Muse Spark hit 58% on Humanity’s Last Exam and 38% on FrontierScience Research—territory that makes it competitive with the most capable versions of Gemini and GPT, rather than their standard releases.

Meta is also rolling out a shopping assistant that compares products and links directly to purchases, and plans to bring Muse Spark to Facebook, Instagram, and WhatsApp in the coming weeks—following the same script implemented since Llama 3, putting it in front of more than 3.5 billion users. A private API preview is opening to select developers.

The model was built in nine months, internally codenamed Avocado, with Meta claiming that its new pretraining stack can reach the same capability level as Llama 4 Maverick using over 10 times less compute.

Muse Spark is described internally as a “small and fast” first step in the Muse family. A more capable version is already in development.


分享到:

  • 在 Facebook 上共享(在新窗口中打开) Facebook
  • 共享到 X(在新窗口中打开) X
  • 共享到 Threads(在新窗口中打开) Threads
  • 共享到 Bluesky(在新窗口中打开) Bluesky
  • 共享到 Telegram(在新窗口中打开) Telegram
  • 共享到 Nextdoor(在新窗口中打开) 隔壁
  • 分享到 Tumblr (在新窗口中打开) Tumblr
  • 共享到 Mastodon(在新窗口中打开) Mastodon

赞过:

赞 正在加载……

相关

发表评论取消回复

近期文章

  • The FBI Says Crypto Scams Stole $11.3 Billion In 2025. Find Out If You Are At Risk
  • Stablecoin Trading Volume Could Skyrocket to $1.5 Quadrillion by 2035: Chainalysis
  • Stablecoin Trading Volume Could Skyrocket to $1.5 Quadrillion by 2035: Chainalysis
  • Bitcoin Depot ATM Operator Says $3.6 Million in BTC Stolen in Corporate Hack
  • US SEC taps new enforcement chief amid questions over predecessor’s exit

归档

  • 2026 年 4 月
  • 2026 年 3 月
  • 2026 年 2 月
  • 2026 年 1 月
  • 2025 年 12 月
  • 2025 年 11 月
  • 2025 年 10 月
  • 2025 年 9 月
  • 2025 年 8 月
  • 2025 年 7 月
  • 2025 年 6 月
  • 2025 年 5 月
  • 2025 年 4 月

分类

  • 1kx (1)
  • 21Shares (1)
  • a16z (1)
  • Aave (3)
  • ai16z (1)
  • Alameda Research (1)
  • Alpaca (1)
  • Arbitrum (1)
  • Ark Invest (1)
  • Arkham (1)
  • Avail (1)
  • Azuki (1)
  • Base (1)
  • Berachain (1)
  • Bitget (8)
  • BlackRock (3)
  • Brian Armstrong (1)
  • BTC (5)
  • Bybit (2)
  • Canary (1)
  • Cathie Wood (1)
  • Coinbase (3)
  • Coinbase Prime (2)
  • Coinbase Ventures (3)
  • CoinDesk (2)
  • CoinGecko (1)
  • Cointelegraph (1)
  • COMP (1)
  • Compound (1)
  • DAO (1)
  • DATA (2)
  • DeAI (1)
  • DePIN (1)
  • DEX (3)
  • EARN (1)
  • Eliza (1)
  • ETF (4)
  • ETH (4)
  • Ethos Network (1)
  • Fartcoin (2)
  • FDUSD (1)
  • FLock.io (1)
  • FLUID (1)
  • FUEL (1)
  • Gas (2)
  • GPU (1)
  • Grayscale (1)
  • IEO (1)
  • Inception (1)
  • IOG (1)
  • Jupiter (1)
  • Kairos (1)
  • Kaito (1)
  • Launchpool (1)
  • Layer2 (1)
  • Liquidity (1)
  • Magicblock (1)
  • Mango Markets (1)
  • Mechanism Capital (1)
  • Meebits (1)
  • Meme (3)
  • Netflix (1)
  • NVIDIA (1)
  • Ondo (1)
  • OpenAI (2)
  • Paradigm (1)
  • Polygon (3)
  • Pudgy Penguins (1)
  • pump.fun (1)
  • Raydium (2)
  • Robert Leshner (1)
  • Robinhood (1)
  • Sam Altman (1)
  • SEC (4)
  • Securitize (1)
  • SideKick (1)
  • SNX (1)
  • SOL (1)
  • Solana (3)
  • Stani Kulechov (1)
  • StarkWare (1)
  • STO (1)
  • Stripe (1)
  • SunDog (1)
  • SunPump (1)
  • Synthetix (1)
  • TechFlow (40,393)
  • The Block (2)
  • Tron (2)
  • TRX (1)
  • Upbit (1)
  • USDC (3)
  • WBTC (2)
  • Web3 (4)
  • WLD (1)
  • WOO X (1)
  • Xai (1)
  • Zora (1)
  • 交易所动态 (8)
  • 人工智能 (1)
  • 以太坊 (4)
  • 以太坊基金会 (1)
  • 信托 (1)
  • 借贷 (2)
  • 公链 (1)
  • 基础设施 (1)
  • 大额投融资 (1)
  • 存储 (2)
  • 孙宇晨 (2)
  • 安全 (2)
  • 富达 (1)
  • 工具 (2)
  • 币安 (7)
  • 快讯 (40,984)
  • 托管 (1)
  • 指数 (1)
  • 支付 (1)
  • 数据 (6)
  • 数据追踪 (4)
  • 智能合约 (1)
  • 未分类 (317)
  • 模块化 (1)
  • 欧洲 (1)
  • 欧盟 (1)
  • 比特币 (7)
  • 永续合约 (1)
  • 治理 (1)
  • 波场 (1)
  • 游戏 (3)
  • 火币 (1)
  • 灰度 (1)
  • 特朗普 (5)
  • 社交 (2)
  • 稳定币 (3)
  • 空投 (6)
  • 纳斯达克 (1)
  • 美国 (6)
  • 美国证券交易委员会 (3)
  • 英伟达 (2)
  • 英国 (1)
  • 萨尔瓦多 (1)
  • 融资 (3)
  • 行情异动 (7)
  • 贝莱德 (1)
  • 质押 (4)
  • 赵长鹏 (1)
  • 跨链 (3)
  • 跨链桥 (1)
  • 迪拜 (1)
  • 重要消息 (45)
  • 金库 (1)
  • 钱包 (4)
  • 阿根廷 (1)
  • 阿里云 (1)
  • 隐私 (2)
  • 项目重要进展 (9)
  • Bluesky
  • Mail
©2026 WT快讯 | Design: Newspaperly WordPress Theme
%d