Encoder Decoder Model

知乎 on MSN

谷歌 Gemini 准确率从 21% 提升至 97% 仅靠「复制粘贴」，这意味着什么?

我了个手动注意力机制，人类的本质是复读机。重要的话说三遍，复读 is all u need！重要的话说三遍，复读 is all u need！重要的话说三遍，复读 is all u need！仔细推导了一下，其实原版 Attention 机制是不会出现这种问题的。这个其实是 Causal LM 才会有的问题，这个技巧本质上是在用 Causal LM ...

CNX Software

AOMedia AV2 video codec draft specification release, and a quick try at the reference ...

After 5 years of work and over 2700 commits against the reference software, the Alliance for Open Media (AOMedia) has ...

Digit on MSN

GLM-image explained: Huawei-powered AI that seriously challenges Nvidia, here’s how

For the past few years, a single axiom has ruled the generative AI industry: if you want to build a state-of-the-art model, ...

腾讯网

覆盖天体物理/地球科学/流变学/声学等19种场景，Polymathic AI构建1.3B ...

作者：梅菜编辑：李宝珠转载请联系本公众号获得授权，并标明来源Polymathic AI 联合研究团队提出了一个以 Transformer 为核心架构、主要面向类流体连续介质动力学的基础模型 Walrus。Walrus 在预训练阶段覆盖了 19 ...

腾讯网

轻舟的VLA与世界模型架构解读

这张架构图展示的是轻舟智航下一代自动驾驶模型架构，核心理念是将 VLA（Vision-Language-Action，视觉-语言-动作模型）与 World Model（世界模型）融合到一个端到端（End-to-End）的系统中。

16 天

OCR迎来“闪电时刻”：LightOnOCR-2以1B模型击败9B竞品，开源即达SOTA！

最近， LightOn 在文档理解领域推出了名为 LightOnOCR-2-1B 的全新模型。这个模型仅用10亿的参数量，就在权威的 OCR 评测基准 OlmOCR-Bench 上取得了当前最佳成绩（SOTA），把一众参数量大它9倍的巨无霸模型甩在了身后。

The Tech Report

Techreport: Your Frontline Source for Tech News

Search Interest in Psiphon and Viva VPN is surging in Iran alongside terms like “VPN” and “filter breaker”, indicating people from the region are looking… ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果