站内搜索

大语言模型LLM

STAR（Synthesis of Tailored Architectures）：深度学习的合成定制化架构

发表评论 / Tech / NullThought

论文STAR: Synthesis of Tailored Architectures（《STAR: 合成定制 […]

STAR（Synthesis of Tailored Architectures）：深度学习的合成定制化架构 Read More »

多种大语言模型（LLMs）在磁共振成像（MRI）技术问题回答中的表现

发表评论 / Tech / NullThought

论文Performance of Large Language Models in Technical MRI

多种大语言模型（LLMs）在磁共振成像（MRI）技术问题回答中的表现 Read More »

大语言模型嵌入（embedding）的回归特征分析

发表评论 / Tech, 科学 / NullThought

论文Understanding LLM Embeddings for Regression深入研究了大语言模型

大语言模型嵌入（embedding）的回归特征分析 Read More »

Star Attention算法：有效提升大型语言模型（LLM）在长序列推理任务中的效率

发表评论 / Tech / NullThought

论文《Star Attention: Efficient LLM Inference over Long Se

Star Attention算法：有效提升大型语言模型（LLM）在长序列推理任务中的效率 Read More »

Re-Invoke：完全无监督的大模型调用工具的检索方法

发表评论 / Tech / NullThought

论文Re-Invoke: Tool Invocation Rewriting for Zero-Shot To

Re-Invoke：完全无监督的大模型调用工具的检索方法 Read More »

Hymba：用于小型语言模型的混合头架构（Hybrid-head Architecture）

发表评论 / Tech / NullThought

论文《Hymba: A Hybrid-head Architecture for Small Language

Hymba：用于小型语言模型的混合头架构（Hybrid-head Architecture） Read More »