#moe

4 posts

ai project Apr 28, 2026

Qwen (Tongyi Qianwen): Alibaba's Open-Source LLM Family, from 72B to 397B — A Complete Evolution Overview

Qwen (Tongyi Qianwen) is Alibaba's open-source LLM family, known for its Apache 2.0 license, 201-language coverage, and rapid iteration. The latest Qwen3.6 (2026/04) focuses on Agentic Coding — the 27B Dense version achieves 77.2% on SWE-bench and 59.3% on Terminal-Bench 2.0, on par with Claude Opus. A new Thinking Preservation feature lets agents retain reasoning context across turns.

#qwen #alibaba #llm #open-source #moe #multimodal #apache2 #ai-model #dashscope #on-device-ai #agentic-coding

ai project Mar 31, 2026

2026 Q1 Open-Source LLM Landscape: From Frontier Models to On-Device, a Complete Survey

2026 Q1 saw a full-blown open-source model explosion: on the LLM front, GLM-5, Kimi K2.5, and Qwen3.5 caught up with closed-source models; Embedding and Reranker are dominated by Qwen3 and BGE; speech has Voxtral TTS and Whisper V3; image has FLUX.2; and video has Wan 2.2 rivaling Sora. This is the complete navigation map.

#open-source #llm #glm-5 #kimi #deepseek #qwen #llama #gemma #mistral #minimax #phi #smollm #gpt-oss #moe #on-device-ai #embedding #reranker #tts #stt #image-generation #video-generation #code-model #ollama #vllm

ai project Mar 26, 2026

GLM-5: Zhipu AI's 744B Open-Source Model Trained Entirely on Huawei Chips

GLM-5 is a 744B MoE open-source model released by Zhipu AI (Z.ai) in February 2026, trained entirely on Huawei Ascend chips and released under the MIT license. It currently ranks as the top open-source model, surpassing Claude and GPT-5 on benchmarks like Humanity's Last Exam, while its API pricing is 1/5 to 1/8 of theirs.

#glm-5 #zhipu-ai #智譜ai #llm #moe #open-source #huawei-ascend #ai-model #agent

ai project Mar 26, 2026

Kimi: How Moonshot AI's Long-Context Model Challenges GPT and Claude

Kimi is a large language model from Chinese AI startup Moonshot AI, known for its ultra-long context window, open-source strategy, and highly competitive pricing. From 200K context in 2023 to K2.5 Agent Swarm in 2026, Kimi has become a force that the global AI market cannot ignore.

#kimi #moonshot-ai #llm #long-context #reasoning #月之暗面 #ai-model #moe #open-source