STAR(Synthesis of Tailored Architectures):深度学习的合成定制化架构
论文STAR: Synthesis of Tailored Architectures(《STAR: 合成定制 […]
STAR(Synthesis of Tailored Architectures):深度学习的合成定制化架构 Read More »
论文STAR: Synthesis of Tailored Architectures(《STAR: 合成定制 […]
STAR(Synthesis of Tailored Architectures):深度学习的合成定制化架构 Read More »
论文Performance of Large Language Models in Technical MRI
多种大语言模型(LLMs)在磁共振成像(MRI)技术问题回答中的表现 Read More »
论文Understanding LLM Embeddings for Regression深入研究了大语言模型
大语言模型嵌入(embedding)的回归特征分析 Read More »
论文《Star Attention: Efficient LLM Inference over Long Se
Star Attention算法:有效提升大型语言模型(LLM)在长序列推理任务中的效率 Read More »
论文Re-Invoke: Tool Invocation Rewriting for Zero-Shot To
Re-Invoke:完全无监督的大模型调用工具的检索方法 Read More »
论文《Hymba: A Hybrid-head Architecture for Small Language
Hymba:用于小型语言模型的混合头架构(Hybrid-head Architecture) Read More »