B’MOJO新架构:通过动态整合形象记忆与渐变记忆,克服现有模型在记忆管理和长序列建模上的局限性
论文B’MOJO: Hybrid State Space Realizations of Foun...
Read MoreChatQA模型,在检索增强生成(RAG)和对话式问答(Conversational QA)任务中表现超越GPT-4?
论文ChatQA: Surpassing GPT-4 on Conversational QA and RAG...
Read MoreGoogle Deepmind报告:把控赋予科学发现全新机遇的人工智能,迎接科学发现黄金时代
Google Deepmind近日发布报告A new golden age of discovery̵...
Read More通过图算法(Graph Algorithms)研究和理解Transformer的推理能力
论文Understanding Transformer Reasoning Capabilities via...
Read MoreCodomain Attention Neural Operator (CoDA-NO):引入值域注意力机制的神经算子
论文Pretraining Codomain Attention Neural Operators for S...
Read More利用大语言模型(LLMs)进行数据挖掘,从而学习肽自组装(peptide self-assembly)的规则
论文Learning the rules of peptide self-assembly through d...
Read More医疗领域基于大语言模型(LLM)的智能代理系统(agentic systems)
论文LLM-based agentic systems in medicine and healthcare探...
Read MorePaliGemma 2: 用于迁移学习的多功能视觉-语言模型(VLM)家族
论文PaliGemma 2: A Family of Versatile VLMs for Transfer(...
Read More视觉自回归建模(Visual AutoRegressive, VAR)
论文Visual Autoregressive Modeling: Scalable Image Genera...
Read More结合卷积神经网络(CNN)的特征提取能力和物理信息神经网络(PINNs)的物理规律建模能力,实现对液滴撞击和两相界面演变的三维高精度重建
论文PINNs4Drops: Convolutional feature-enhanced physics-i...
Read MoreGenie 2:大型基础世界模型(large-scale foundation world model)
Google DeepMind刚推出了Genie 2。Genie 2是一种基础世界模型,能够生成无限多样的、可...
Read MoreVLsI模型:逐层蒸馏,逐层对齐,实现从大规模到小规模视觉-语言模型(VLM)的高效知识迁移
论文《VLsI: Verbalized Layers-to-Interactions from Large t...
Read More通过多目标优化(multi-objective optimization)自动发现最佳元解算器(meta-solvers)
论文Automatic discovery of optimal meta-solvers via multi...
Read MoreSTAR(Synthesis of Tailored Architectures):深度学习的合成定制化架构
论文STAR: Synthesis of Tailored Architectures(《STAR: 合成定制...
Read More库珀站(Cooper Station)和斯坦福环(Stanford Torus)
在Youtube上看电影《星际穿越》(Interstellar)。电影临近结尾,主人公库珀(Cooper)被救...
Read MoreModel Context Protocol(MCP):模型上下文协议
近日,Anthropic公司发布了“模型上下文协议(Model Context Protocol,MCP)”。...
Read More多种大语言模型(LLMs)在磁共振成像(MRI)技术问题回答中的表现
论文Performance of Large Language Models in Technical MRI...
Read MoreStar Attention算法:有效提升大型语言模型(LLM)在长序列推理任务中的效率
论文《Star Attention: Efficient LLM Inference over Long Se...
Read MoreHAI-DEF(Health AI Developer Foundations):健康AI开发基础模型集
论文《Health AI Developer Foundations》详细描述了由Google研究团队和Dee...
Read MoreNASA选择SpaceX的猎鹰重型火箭(Falcon Heavy)提供发射服务,用于执行“蜻蜓”(Dragonfly)任务
近日NASA宣布,将选择 SpaceX 提供发射服务,用于执行“蜻蜓”(Dragonfly)任务,这是 NAS...
Read More