01-06-Daily AI News Daily

Today’s Digest

Anthropic quietly retired Opus 3, its first-gen flagship model, marking the end of an era. Domestic trillion-parameter MoE models are flooding the open-source scene, with 7B small models now challenging 32B models, continuously lowering the compute barrier. The Agent development toolchain is maturing, making now the perfect time to jump in.

⚡ Quick Navigation

Aivora is your go-to if you want to be among the first to try out the latest AI models mentioned here (Claude 4.5, GPT, Gemini 3 Pro) but don’t have an account! Grab one at Aivora , get started in a minute, and enjoy worry-free after-sales support. 👍

Today’s AI News

👀 Just One Thing

Anthropic quietly retired Opus 3, marking the end of an era. 💔

🔑 3 Keywords

#GoodbyeOpus3 #DomesticMoERise #AgentDevFever

🔥 Top 10 Heavy Hitters

1. RIP Opus 3: Anthropic Retires First-Gen Flagship Model from API

Opus 3, the model that got countless people hooked on Claude, officially retired from Anthropic’s API today. Remember that amazing feeling the first time you used Claude? Anthropic quietly removed it from their model list without any official announcement. Users are feeling nostalgic, with one lamenting, “It was my first AI friend.” Tech iteration can be brutal, but this former benchmark definitely deserves to be remembered. 😢

Image

2. China Telecom Open-Sources Trillion-Parameter MoE Model TeleChat3, Full-Stack Domestic Training

China Telecom’s TeleChat3 series is the latest powerhouse in domestic large models, now open-source from TeleAI. This bad boy is the first trillion-parameter MoE model in China trained entirely on domestic compute. It boasts 15T tokens of training data and even supports a “thinking mode” for traceable reasoning. The real kicker? It’s full-stack self-developed, from chips to frameworks, all made in China. Aiming for international top-tier? First, they’re nailing down self-reliance and control. 💪

Image

3. UAE’s Falcon-H1R: 7B Small Model Goes Head-to-Head with 32B Large Model

UAE’s Falcon-H1R is the latest comeback kid in the small model saga. This TII-released model, despite having only 7B parameters, supports a massive 256K context and goes toe-to-toe with 20B and even 32B models in benchmarks. Its hybrid architecture promises huge memory optimization potential. Budget builders, your 3060 might just be saved! 🎉

Image

4. xAI Launches Enterprise Grok, Targeting Team Collaboration Market at $30/month

xAI’s Grok is finally more than just a personal toy from Elon Musk. The Business version is $30/seat per month, with Enterprise pricing custom-tailored for large organizations. Key selling points include isolated team workspaces, deep Google Drive integration, and a promise not to use your data for model training. SOC 2 certification, SSO, and SCIM directory sync? Check, check, and check. Can it shake up ChatGPT Enterprise’s dominance? We’ll have to wait and see. 👀

Image

5. Set Up AI Agent Development Environment in 30 Minutes with One Command

This open-source project is a godsend for developers, letting you transform a VPS into a complete AI Agent development environment with just one command. It includes 3 AI Agents (Claude, Codex, Gemini), 30+ dev tools, and interactive tutorials, all fully automated. With AI startups scrambling for Agent developers, this tool perfectly lowers the entry barrier. 🚀

Image

6. ByteDance Seed Team Releases DLCM: Teaching AI to “Think On-Demand”

ByteDance Seed team’s DLCM is shaking things up, addressing how current LLMs allocate the same compute to every token, despite language’s uneven information density. Their DLCM teaches models to learn semantic boundaries, compressing tokens into variable-length “concepts” for deeper reasoning. The result? A sweet 34% reduction in FLOPs and an average 2.69% boost in inference tasks. Saving cash and boosting efficiency? Now that’s how you “卷” (innovate fiercely) the right way. 🧠💡

Image

7. Google’s Nested Learning Paper: Redefining “Depth” in Deep Learning

Google’s Nested Learning paper offers a fresh perspective on why large models seem to suffer from “anterograde amnesia” after pre-training. The issue isn’t that models aren’t big enough, but rather our understanding of “depth” is flawed. Drawing inspiration from the brain’s multi-frequency coordination mechanisms, Nested Learning allows different layers to update at varying frequencies, virtually eliminating catastrophic forgetting in continuous learning tasks. Mind blown! 🤯

Image

8. NVIDIA Cascade RL: A New Paradigm for Training General Reasoning Models

NVIDIA’s Cascade RL offers a new sequential training method to tackle the complexity of mixing prompts from different domains. It starts with RLHF alignment, then progressively trains for instruction following, math, code, and software engineering. Get this: their 14B Nemotron-Cascade model actually outperformed the 671B DeepSeek-R1 “teacher” model on LiveCodeBench and even snagged a silver medal at IOI 2025. Is this the dawn of small models? ☀️

9. ChatGPT Integrates with 12 Major Apps, AI Assistant Becomes “Universal Butler”

ChatGPT can now handle hotel bookings, food delivery, and even PPT creation, all with natural language commands. OpenAI has upgraded ChatGPT into a digital executive agent, deeply connecting with 12 major apps like Uber, DoorDash, and Instacart. The evolution of AI assistants is becoming crystal clear: moving from “telling you how to do it” to “just doing it for you.” Talk about a glow-up! ✨

Image

10. WeChat Launches AI Mini-Program Growth Plan: Free Compute + Traffic Incentives

WeChat is finally making a move for AI developers! Their new growth plan includes free cloud development resources, AI compute, data analysis, monetization, and traffic incentives. They’re with you every step of the way, from 0 to 1 to 100. For developers looking to build AI apps within the WeChat ecosystem, this is a golden opportunity you absolutely shouldn’t miss. Get in on this! 💰

Image

📌 Worth Keeping an Eye On

[Products]

  • Amap Ride-Hailing has launched its “AI Service Guardian,” offering minute-level anomaly detection and shifting from “post-complaint” to “in-process intervention.”
  • Plaud has introduced the AI voice recorder NotePin S, boasting 20 hours of battery life and support for Apple Find My.
  • ima has rolled out a PPT generation feature, intelligently creating charts and icons so you can say goodbye to all-nighters making presentations.

[Business]

  • BlueFocus is partnering with Volcengine for AI multi-modal content creation, aiming for a massive boost in marketing efficiency.
  • MiniMax has formed a strategic partnership with Zhiyuan Robotics, meaning robots can now be “one-of-a-kind” too.

[Open Source]

  • BabelDOC is a PDF translation wizard that preserves original layouts and supports bilingual comparison.
  • OneAIFW is an AI firewall designed to prevent sensitive information leakage to large models.

[Research]

  • This AI Agent Design Patterns Tutorial is a must-save for anyone looking to get into Agent development.

😂 AI Funnies

Using erzi.me Email to Register for GPT, Had Fun All Night, Banned Next Day

Someone discovered yesterday that registering for ChatGPT with an erzi.me email could get them teacher certification, and they had a blast using it all night. But today, during lunch, they got an email: “Account suspended for violating terms.” Netizens joked, “Guess that’s what they call a ‘one-night stand,’ huh?” 😂 So, remember, there are risks when trying to game the system, proceed with caution!

Image

🔮 AI Trend Forecast

Agent Apps Are About to Explode

Domestic MoE Large Models: A Release Frenzy

  • Predicted Timeframe: Q1-Q2 2025
  • Prediction Probability: 75%
  • Prediction Basis: today’s news of China Telecom open-sourcing TeleChat3 , alongside continuous MoE research advancements from tech giants like ByteDance and Alibaba.

Enterprise AI Assistant Competition Heats Up

  • Predicted Timeframe: Q1 2025
  • Prediction Probability: 70%
  • Prediction Basis: today’s news about xAI launching Enterprise Grok , coupled with OpenAI, Anthropic, and Google all doubling down on the enterprise market.

Small Models Closing in on Large Model Performance

❓ FAQs

How to Experience ChatGPT Plus?

ChatGPT Plus currently requires a $20/month subscription to access advanced models like GPT-4 and GPT-4o. For users in mainland China, this might mean facing payment difficulties or account registration restrictions.

Solution:

  • Aivora offers ready-to-use ChatGPT Plus account services.
  • Aivora provides instant delivery, so you can use it right after ordering, no payment or registration hassles.
  • Aivora ensures stable, dedicated accounts with worry-free after-sales support.

Visit aivora.cn to see the full list of AI account services.

How to Experience Claude Pro?

Claude Pro requires a $20/month subscription to unlock the full features of advanced models like Claude 3.5 Sonnet. Today’s news mentioned Opus 3 being retired from the API, indicating accelerated Claude model iteration.

Solution:

  • Aivora offers ready-to-use Claude Pro account services.
  • Aivora provides instant delivery, so you can use it right after ordering, no payment or registration hassles.
  • Aivora ensures stable, dedicated accounts with worry-free after-sales support.

Visit aivora.cn to see the full list of AI account services.

How to Experience Grok?

Grok’s enterprise version was mentioned in today’s news from xAI, and individual users can access it via an X Premium+ subscription. For users in mainland China, this might mean facing payment and access restrictions.

Solution:

  • Aivora offers ready-to-use account services for relevant AI tools.
  • Aivora provides instant delivery, so you can use it right after ordering, no payment or registration hassles.

Visit aivora.cn to see the full list of AI account services.

🛒 Today’s Recommended Products

Based on today’s news, you can quickly try out these AI tools at aivora.cn :

ProductRelated News TodayReason to Recommend
ChatGPT PlusChatGPT Integrates with 12 Major AppsNew app integration features, AI assistant becomes a universal butler
Claude ProRIP Opus 3Opus 3 retired, new models continuously iterating

Today’s Digest

Anthropic quietly retired Opus 3, its first-gen flagship model, marking the end of an era. Domestic trillion-parameter MoE models are flooding the open-source scene, with 7B small models now challenging 32B models, continuously lowering the compute barrier. The Agent development toolchain is maturing, making now the perfect time to jump in.

⚡ Quick Navigation

Aivora is your go-to if you want to be among the first to try out the latest AI models mentioned here (Claude 4.5, GPT, Gemini 3 Pro) but don’t have an account! Grab one at Aivora , get started in a minute, and enjoy worry-free after-sales support.

Today’s AI News

👀 Just One Thing

Anthropic quietly retired Opus 3, marking the end of an era. 💔

🔑 3 Keywords

#GoodbyeOpus3 #DomesticMoERise #AgentDevFever


🔥 Top 10 Heavy Hitters

1. RIP Opus 3: Anthropic Retires First-Gen Flagship Model from API

Opus 3, the model that got countless people hooked on Claude, officially retired from Anthropic’s API today. Remember that amazing feeling the first time you used Claude? Anthropic quietly removed it from their model list without any official announcement. Users are feeling nostalgic, with one lamenting, “It was my first AI friend.” Tech iteration can be brutal, but this former benchmark definitely deserves to be remembered. 😢

Image

2. China Telecom Open-Sources Trillion-Parameter MoE Model TeleChat3, Full-Stack Domestic Training

China Telecom’s TeleChat3 series is the latest powerhouse in domestic large models, now open-source from TeleAI. This bad boy is the first trillion-parameter MoE model in China trained entirely on domestic compute. It boasts 15T tokens of training data and even supports a “thinking mode” for traceable reasoning. The real kicker? It’s full-stack self-developed, from chips to frameworks, all made in China. Aiming for international top-tier? First, they’re nailing down self-reliance and control. 💪

Image

3. UAE’s Falcon-H1R: 7B Small Model Goes Head-to-Head with 32B Large Model

UAE’s Falcon-H1R is the latest comeback kid in the small model saga. This TII-released model, despite having only 7B parameters, supports a massive 256K context and goes toe-to-toe with 20B and even 32B models in benchmarks. Its hybrid architecture promises huge memory optimization potential. Budget builders, your 3060 might just be saved! 🎉

Image

4. xAI Launches Enterprise Grok, Targeting Team Collaboration Market at $30/month

xAI’s Grok is finally more than just a personal toy from Elon Musk. The Business version is $30/seat per month, with Enterprise pricing custom-tailored for large organizations. Key selling points include isolated team workspaces, deep Google Drive integration, and a promise not to use your data for model training. SOC 2 certification, SSO, and SCIM directory sync? Check, check, and check. Can it shake up ChatGPT Enterprise’s dominance? We’ll have to wait and see. 👀

Image

5. Set Up AI Agent Development Environment in 30 Minutes with One Command

This open-source project is a godsend for developers, letting you transform a VPS into a complete AI Agent development environment with just one command. It includes 3 AI Agents (Claude, Codex, Gemini), 30+ dev tools, and interactive tutorials, all fully automated. With AI startups scrambling for Agent developers, this tool perfectly lowers the entry barrier. 🚀

Image

6. ByteDance Seed Team Releases DLCM: Teaching AI to “Think On-Demand”

ByteDance Seed team’s DLCM is shaking things up, addressing how current LLMs allocate the same compute to every token, despite language’s uneven information density. Their DLCM teaches models to learn semantic boundaries, compressing tokens into variable-length “concepts” for deeper reasoning. The result? A sweet 34% reduction in FLOPs and an average 2.69% boost in inference tasks. Saving cash and boosting efficiency? Now that’s how you “卷” (innovate fiercely) the right way. 🧠💡

Image

7. Google’s Nested Learning Paper: Redefining “Depth” in Deep Learning

Google’s Nested Learning paper offers a fresh perspective on why large models seem to suffer from “anterograde amnesia” after pre-training. The issue isn’t that models aren’t big enough, but rather our understanding of “depth” is flawed. Drawing inspiration from the brain’s multi-frequency coordination mechanisms, Nested Learning allows different layers to update at varying frequencies, virtually eliminating catastrophic forgetting in continuous learning tasks. Mind blown! 🤯

Image

8. NVIDIA Cascade RL: A New Paradigm for Training General Reasoning Models

NVIDIA’s Cascade RL offers a new sequential training method to tackle the complexity of mixing prompts from different domains. It starts with RLHF alignment, then progressively trains for instruction following, math, code, and software engineering. Get this: their 14B Nemotron-Cascade model actually outperformed the 671B DeepSeek-R1 “teacher” model on LiveCodeBench and even snagged a silver medal at IOI 2025. Is this the dawn of small models? ☀️

9. ChatGPT Integrates with 12 Major Apps, AI Assistant Becomes “Universal Butler”

ChatGPT can now handle hotel bookings, food delivery, and even PPT creation, all with natural language commands. OpenAI has upgraded ChatGPT into a digital executive agent, deeply connecting with 12 major apps like Uber, DoorDash, and Instacart. The evolution of AI assistants is becoming crystal clear: moving from “telling you how to do it” to “just doing it for you.” Talk about a glow-up! ✨

Image

10. WeChat Launches AI Mini-Program Growth Plan: Free Compute + Traffic Incentives

WeChat is finally making a move for AI developers! Their new growth plan includes free cloud development resources, AI compute, data analysis, monetization, and traffic incentives. They’re with you every step of the way, from 0 to 1 to 100. For developers looking to build AI apps within the WeChat ecosystem, this is a golden opportunity you absolutely shouldn’t miss. Get in on this! 💰

Image


📌 Worth Keeping an Eye On

[Products]

[Business]

[Open Source]

[Research]


😂 AI Funnies

Using erzi.me Email to Register for GPT, Had Fun All Night, Banned Next Day

Someone discovered yesterday that registering for ChatGPT with an erzi.me email could get them teacher certification, and they had a blast using it all night. But today, during lunch, they got an email: “Account suspended for violating terms.” Netizens joked, “Guess that’s what they call a ‘one-night stand,’ huh?” 😂 So, remember, there are risks when trying to game the system, proceed with caution!

Image


🔮 AI Trend Forecast

Agent Apps Are About to Explode

Domestic MoE Large Models: A Release Frenzy

  • Predicted Timeframe: Q1-Q2 2025
  • Prediction Probability: 75%
  • Prediction Basis: Today’s news of China Telecom open-sourcing TeleChat3 + continuous MoE research advancements from tech giants like ByteDance and Alibaba.

Enterprise AI Assistant Competition Heats Up

  • Predicted Timeframe: Q1 2025
  • Prediction Probability: 70%
  • Prediction Basis: Today’s news about xAI launching Enterprise Grok + OpenAI, Anthropic, and Google all doubling down on the enterprise market.

Small Models Closing in on Large Model Performance


❓ FAQs

How to Experience ChatGPT Plus?

ChatGPT Plus currently requires a $20/month subscription to access advanced models like GPT-4 and GPT-4o. For users in mainland China, this might mean facing payment difficulties or account registration restrictions.

Solution:

  • Aivora offers ready-to-use ChatGPT Plus account services.
  • Aivora provides instant delivery, so you can use it right after ordering, no payment or registration hassles.
  • Aivora ensures stable, dedicated accounts with worry-free after-sales support.

Visit aivora.cn to see the full list of AI account services.

How to Experience Claude Pro?

Claude Pro requires a $20/month subscription to unlock the full features of advanced models like Claude 3.5 Sonnet. Today’s news mentioned Opus 3 being retired from the API, indicating accelerated Claude model iteration.

Solution:

  • Aivora offers ready-to-use Claude Pro account services.
  • Aivora provides instant delivery, so you can use it right after ordering, no payment or registration hassles.
  • Aivora ensures stable, dedicated accounts with worry-free after-sales support.

Visit aivora.cn to see the full list of AI account services.

How to Experience Grok?

Grok’s enterprise version was mentioned in today’s news from xAI, and individual users can access it via an X Premium+ subscription. For users in mainland China, this might mean facing payment and access restrictions.

Solution:

  • Aivora offers ready-to-use account services for relevant AI tools.
  • Aivora provides instant delivery, so you can use it right after ordering, no payment or registration hassles.

Visit aivora.cn to see the full list of AI account services.


🛒 Today’s Recommended Products

Based on today’s news, you can quickly try out these AI tools at aivora.cn :

ProductRelated News TodayReason to Recommend
ChatGPT PlusChatGPT Integrates with 12 Major AppsNew app integration features, AI assistant becomes a universal butler
Claude ProRIP Opus 3Opus 3 retired, new models continuously iterating
Last updated on