机器学习2024年度国际大会

第41界机器学习国际大会(International Conference on Machine Learning, ICML 2024)于2024年7月21日到27日在奥地利维也纳召开。看看大会所接受论文(Accepted papers)列表,可以了解当前机器学习研究的最新动态和前沿方向

论文列表如下(中文由ChatGPT 4o翻译):

Perturb-and-Project: Differentially Private Similarities and Marginals
扰动与投影:差分隐私相似性和边缘分布
作者:Vincent Cohen-Addad, Tommaso d’Orsi, Alessandro Epasto, Vahab Mirrokni, Peilin Zhong

Replicable Learning of Large-Margin Halfspaces
大边距半空间的可复制学习
作者:Alkis Kalavasis, Amin Karbasi, Kasper Green Larsen, Grigoris Velegkas, Felix Zhou

Decoding-time Realignment of Language Models
语言模型解码时的重新对齐
作者:Tianlin Liu, Shangmin Guo, Leonardo Bianco*, Daniele Calandriello, Quentin Berthet, Felipe Llinares-López, Jessica Hoffmann, Lucas Dixon, Michal Valko, Mathieu Blondel

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
目标网络和过度参数化通过函数逼近稳定离线策略引导
作者:Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez*, Christopher K Harris*, A. Rupam Mahmood, Dale Schuurmans

Dynamic Correlation Clustering in Sublinear Update Time
次线性更新时间的动态相关聚类
作者:Vincent Cohen-Addad, Silvio Lattanzi, Andreas Maggiori, Nikos Parotsidis

PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses
PriorBoost:一种自适应算法,用于从汇总响应中学习
作者:Adel Javanmard, Matthew Fahrbach, Vahab Mirrokni

How Free is Parameter-Free Stochastic Optimization?
无参数随机优化有多自由?
作者:Amit Attia, Tomer Koren

Practical Performance Guarantees for Pipelined DNN Inference
流水线DNN推理的实用性能保证
作者:Aaron Archer, Matthew Fahrbach, Kuikui Liu, Prakash Prabhu

Regression with Multi-Expert Deferral
多专家推迟的回归
作者:Anqi Mao, Mehryar Mohri, Yutao Zhong

Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond
通过基于聚类的敏感性采样实现数据高效学习:基础模型及其应用
作者:Kyriakos Axiotis, Vincent Cohen-Addad, Monika Henzinger, Sammy Jerome, Vahab Mirrokni, David Saulpic, David Woodruff, Michael Wunder

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
用于解耦潜在空间的等距表示学习
作者:Jaehoon Hahm, Junho Lee, Sunghyun Kim, Joonseok Lee

Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
从学生中学习:应用t分布探索准确和高效的LLM格式
作者:Jordan Dotzel, Yuzong Chen, Bahaa Kotb, Sushma Prasad, Gang Wu, Sheng Li, Mohamed S. Abdelfattah, Zhiru Zhang

LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views
LEVI:通过不同视角的分层集成实现可泛化微调
作者:Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

Out of the Ordinary: Spectrally Adapting Regression for Covariate Shift
非常规:为协变量漂移光谱调整回归
作者:Benjamin Eyre, Elliot Creager, David Madras, Vardan Papyan, Richard Zemel

Privacy-Preserving Instructions for Aligning Large Language Models
隐私保护的指令用于对齐大型语言模型
作者:Da Yu*, Peter Kairouz, Sewoong Oh, Zheng Xu

Representation Surgery: Theory and Practice of Affine Steering
表示手术:仿射转向的理论与实践
作者:Shashwat Singh, Shauli Ravfogel*, Jonathan Herzig, Roee Aharoni, Ryan Cotterell, Ponnurangam Kumaraguru

A Statistical Framework for Data-dependent Retrieval-Augmented Models
数据依赖的检索增强模型的统计框架
作者:Soumya Basu, Ankit Singh Rawat, Manzil Zaheer

Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and Rejection
两个头确实比一个好:通过转导和拒绝实现更好的对抗鲁棒性
作者:Nils Palumbo, Yang Guo, Xi Wu, Jiefeng Chen, Yingyu Liang, Somesh Jha

Bayesian Regret Minimization in Offline Bandits
离线带中贝叶斯后悔最小化
作者:Marek Petrik, Guy Tennenholtz, Mohammad Ghavamzadeh

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
使用前瞻解码打破LLM推理的顺序依赖
作者:Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang

Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
大型代码模型能理解编程概念吗?代码谓词的反事实分析
作者:Ashish Hooda*, Mihai Christodorescu, Miltiadis Allamanis, Aaron Wilson, Kassem Fawaz, Somesh Jha

DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems
DySLIM:通过不变测度实现混沌系统的动态稳定学习
作者:Yair Schiff, Zhong Yi Wan, Jeffrey B. Parker, Stephan Hoyer, Volodymyr Kuleshov, Fei Sha, Leonardo Zepeda-Núñez

A Field Guide for Pacing Budget and ROS Constraints
预算和ROS约束的节奏指南
作者:Santiago R. Balseiro, Kshipra Bhawalkar, Zhe Feng, Haihao Lu, Vahab Mirrokni, Balasubramanian Sivan, Di Wang

How Private is DP-SGD?
DP-SGD有多私密?
作者:Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

Improved Differentially Private and Lazy Online Convex Optimization: Lower Regret without Smoothness Requirements
改进的差分隐私和惰性在线凸优化:无需光滑性的低后悔
作者:Naman Agarwal, Satyen Kale, Karan Singh, Abhradeep Guha Thakurta

LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
LayerMerge:通过层剪枝和合并实现神经网络深度压缩
作者:Jinuk Kim, Marwa El Halabi, Mingi Ji, Hyun Oh Song

Learning and Forgetting Unsafe Examples in Large Language Models
大型语言模型中的学习与遗忘不安全示例
作者:Jiachen Zhao, Zhun Deng, David Madras, James Zou, Mengye Ren

A Near-Linear Time Approximation Algorithm for Beyond-Worst-Case Graph Clustering
超越最坏情况图聚类的近线性时间近似算法
作者:Vincent Cohen-Addad, Tommaso d’Orsi, Aida Mousavifar

The Non-linear F-Design and Applications to Interactive Learning
非线性F设计及其在交互学习中的应用
作者:Alekh Agarwal, Jian Qian, Alexander Rakhlin, Tong Zhang

Pi-DUAL: Using Privileged Information to Distinguish Clean from Noisy Labels
Pi-DUAL:利用特权信息区分干净标签和噪声标签
作者:Ke Wang, Guillermo Ortiz-Jimenez, Rodolphe Jenatton, Mark Collier, Efi Kokiopoulou, Pascal Frossard

Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
破解边缘化社区级联差距的密码
作者:Golnoosh Farnadi, Mohammad Havaei, Negar Rostamzadeh

Unmasking Vulnerabilities: Cardinality Sketches Under Adaptive Inputs
揭示漏洞:自适应输入下的基数草图
作者:Sara Ahmadian, Edith Cohen

What is Dataset Distillation Learning?
什么是数据集蒸馏学习?
作者:William Yang, Ye Zhu, Zhiwei Deng, Olga Russakovsky

Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?
循环变压器能否学习实现上下文学习的多步梯度下降?
作者:Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar

Cell2Sentence: Teaching Large Language Models the Language of Biology
Cell2Sentence:教大型语言模型生物学语言
作者:Daniel Levine, Syed A Rizvi, Sacha Lévy, Nazreen Pallikkavaliyaveetil, David Zhang, Xingyu Chen, Sina Ghadermarzi, Ruiming Wu, Zihe Zheng, Ivan Vrkic, Anna Zhong, Daphne Raskin, Insu Han, Antonio Henrique de Oliveira Fonseca, Josue Ortega Caro, Amin Karbasi, Rahul Madhav Dhodapkar, David van Dijk

Consistent Submodular Maximization
一致性子模最大化
作者:Paul Duetting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zaddimoghadam

Controlled Decoding from Language Models
语言模型的受控解码
作者:Sidharth Mudgal, Jong Lee, Harish Ganapathy, YaGuang Li, Tao Wang*, Yanping Huang, Zhifeng Chen, Heng-Tze Cheng, Michael Collins, Trevor Strohman, Jilin Chen, Alex Beutel*, Ahmad Beirami

Differentially Private Domain Adaptation with Theoretical Guarantees
具有理论保证的差分隐私领域适应
作者:Raef Bassily, Corinna Cortes, Anqi Mao, Mehryar Mohri

Eluder-Based Regret for Stochastic Contextual MDPs
随机上下文MDP的Eluder型后悔
作者:Orin Levy, Asaf Cassel, Alon Cohen, Yishay Mansour

A Minimaximalist Approach to Reinforcement Learning from Human Feedback
从人类反馈中学习的极小极大方法
作者:Gokul Swamy*, Christoph Dann, Rahul Kidambi, Zhiwei Steven Wu, Alekh Agarwal

Multi-View Stochastic Block Models
多视角随机块模型
作者:Vincent Cohen-Addad, Tommaso d’Orsi, Silvio Lattanzi, Rajai Nasser

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
具有聚合带反馈的线性MDP的近最优后悔
作者:Asaf Cassel, Haipeng Luo, Aviv Rosenberg, Dmitry Sotnikov

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Patchscopes:检查语言模型隐藏表示的统一框架
作者:Asma Ghandeharioun, Avi Caciularu, Adam Pearce, Lucas Dixon, Mor Geva

Robust Inverse Graphics via Probabilistic Inference
通过概率推理实现鲁棒逆向图形学
作者:Tuan Anh Le, Pavel Sountsov, Matthew Douglas Hoffman, Ben Lee, Brian Patton, Rif A. Saurous

Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation
评分身份蒸馏:预训练扩散模型的指数快速蒸馏用于一步生成
作者:Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang

Tandem Transformers for Inference Efficient LLMs
用于高效推理的串联变压器
作者:Aishwarya P S, Pranav Ajit Nair, Yashas Samaga B L, Toby James Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

Transforming and Combining Rewards for Aligning Large Language Models
调整和组合奖励以对齐大型语言模型
作者:Zihao Wang, Chirag Nagpal, Jonathan Berant, Jacob Eisenstein, Alexander D’Amour, Sanmi Koyejo, Victor Veitch

USTAD: Unified Single-Model Training Achieving Diverse Scores for Information Retrieval
USTAD:实现信息检索多样化得分的统一单模型训练
作者:Seungyeon Kim, Ankit Singh Rawat, Manzil Zaheer, Wittawat Jitkrittum, Veeranjaneyulu Sadhanala, Sadeep Jayasumana, Aditya Krishna Menon, Rob Fergus, Sanjiv Kumar

Adaptive Accompaniment with ReaLchords
使用ReaLchords进行自适应伴奏
作者:Yusong Wu, Tim Cooijmans, Kyle Kastner, Adam Roberts, Ian Simon, Alexander Scarlatos, Chris Donahue, Cassie Tarakajian, Shayegan Omidshafiei*, Aaron Courville, Pablo Samuel Castro, Natasha Jaques, Cheng-Zhi Anna Huang

A Decoder-Only Foundation Model for Time-Series Forecasting
用于时间序列预测的仅解码器基础模型
作者:Abhimanyu Das, Weihao Kong, Rajat Sen, Yichen Zhou

Deep Fusion: Efficient Network Training via Pre-trained Initializations
深度融合:通过预训练初始化实现高效网络训练
作者:Hanna Mazzawi, Javier Gonzalvo, Michael Wunder, Sammy Jerome, Benoit Dherin

Extracting Training Data from Document-Based VQA Models
从基于文档的视觉问答模型中提取训练数据
作者:Francesco Pinto, Nathalie Rauschmayr, Florian Tramer, Philip Torr, Federico Tombari

FrameQuant: Flexible Low-Bit Quantization for Transformers
FrameQuant:用于变压器的灵活低比特量化
作者:Harshavardhan Adepu, Zhanpeng Zeng, Li Zhang, Vikas Singh

H-Consistency Guarantees for Regression
回归的H-一致性保证
作者:Anqi Mao, Mehryar Mohri, Yutao Zhong

Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
线性二次控制中策略梯度的隐式偏差:向未见初始状态的外推
作者:Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

Interpretability Illusions in the Generalization of Simplified Models
简化模型推广中的可解释性幻觉
作者:Dan Friedman*, Andrew Kyle Lampinen, Lucas Dixon, Danqi Chen, Asma Ghandeharioun

Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning
大型语言模型可以自动设计特征以进行少样本表格学习
作者:Sungwon Han*, Jinsung Yoon, Sercan O Arik, Tomas Pfister

MC-GTA: Metric-Constrained Model-Based Clustering Using Goodness-of-Fit Tests with Autocorrelations
MC-GTA:使用拟合优度测试和自相关的度量约束模型聚类
作者:Zhangyu Wang, Gengchen Mai, Krzysztof Janowicz, Ni Lao

Mean Estimation in the Add-Remove Model of Differential Privacy
差分隐私加移模型中的均值估计
作者:Alex Kulesza, Ananda Suresh, Yuyan Wang

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
分布化的更多好处:强化学习的二阶界限
作者:Kaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun

Online Learning with Bounded Recall
有界回忆的在线学习
作者:Jon Schneider, Kiran Vodrahalli

Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
离群值加权层次稀疏性(OWL):修剪LLM到高稀疏性的秘密调料
作者:Lu Yin, You Wu, Zhenyu Zhang, Cheng-Yu Hsieh, Yaqing Wang, Yiling Jia, Gen Li, Ajay Kumar Jaiswal, Mykola Pechenizkiy, Yi Liang, Michael Bendersky, Zhangyang Wang, Shiwei Liu

Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines
生成遮掩语言模型的承诺和陷阱:理论框架和实用指南
作者:Yuchen Li, Alexandre Kirchmeyer, Aashay Mehta, Yilong Qin, Boris Dadachev, Kishore Papineni, Sanjiv Kumar, Andrej Risteski

SCoRe: Submodular Combinatorial Representation Learning
SCoRe:子模组合表示学习
作者:Anay Majee, Suraj Kothawade, Krishnateja Killamsetty, Rishabh K Iyer

Simplicity Bias via Global Convergence of Sharpness Minimization
通过锐度最小化的全局收敛实现的简单性偏见
作者:Khashayar Gatmiry, Zhiyuan Li, Sashank J. Reddi, Stefanie Jegelka

Auto-Linear Phenomenon in Subsurface Imaging
地下成像中的自动线性现象
作者:Yinan Feng, Yinpeng Chen, Peng Jin, Shihang Feng, Youzuo Lin

FRAPPÉ: A Group Fairness Framework for Post-Processing Everything
FRAPPÉ:一个用于后处理所有内容的群体公平框架
作者:Alexandru Tifrea*, Preethi Lahoti, Ben Packer, Yoni Halpern, Ahmad Beirami, Flavien Prost

Individualized Privacy Accounting via Subsampling with Applications in Combinatorial Optimization
通过子采样实现的个性化隐私计量及其在组合优化中的应用
作者:Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Adam Sealfon

Online Speculative Decoding
在线投机解码
作者:Xiaoxuan Liu, Lanxiang Hu, Peter Bailis, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang

The Pitfalls of Next-Token Prediction
下一个令牌预测的陷阱
作者:Gregor Bachmann, Vaishnavh Nagarajan

PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels
PolySketchFormer:通过绘制多项式核实现快速变压器
作者:Praneeth Kacham, Vahab Mirrokni, Peilin Zhong

Position: Social Environment Design Should be Further Developed for AI-based Policy-Making
观点:社会环境设计应进一步发展以支持基于AI的政策制定
作者:Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen

Prompt-Tuning Latent Diffusion Models for Inverse Problems
用于逆问题的提示调整潜在扩散模型
作者:Hyungjin Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio

VideoPrism: A Foundational Visual Encoder for Video Understanding
一个用于视频理解的基础视觉编码器
作者:Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao*, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
RLAIF vs. RLHF:通过AI反馈扩展人类反馈强化学习
作者:Harrison Lee, Samrat Phatale, Hassan Mansoor, Thomas Mesnard, Johan Ferret, Kellie Ren Lu, Colton Bishop, Ethan Hall, Victor Carbune, Abhinav Rastogi, Sushant Prakash

From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers
从自注意力到马尔可夫模型:揭示生成变压器的动态
作者:Muhammed Emrullah Ildiz, Yixiao Huang, Yingcong Li, Ankit Singh Rawat, Samet Oymak

Generalized Neural Collapse for a Large Number of Classes
大量类别的广义神经坍塌
作者:Jiachen Jiang, Jinxin Zhou, Peng Wang, Qing Qu, Dustin G. Mixon, Chong You, Zhihui Zhu

High-Dimensional Geometric Streaming for Nearly Low Rank Data
近低秩数据的高维几何流
作者:Hossein Esfandiari, Praneeth Kacham, Vahab Mirrokni, David Woodruff, Peilin Zhong

Improved Communication-Privacy Trade-Offs in L2 Mean Estimation Under Streaming Differential Privacy
在流式差分隐私下L2均值估计中改进的通信-隐私权衡
作者:Wei-Ning Chen, Berivan Isik, Peter Kairouz, Albert No, Sewoong Oh, Zheng Xu

On Discrete Prompt Optimization for Diffusion Models
关于扩散模型的离散提示优化
作者:Ruochen Wang, Ting Liu, Cho-Jui Hsieh, Boqing Gong

OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization
OSSCAR:通过组合优化在视觉和语言模型中的一次性结构化剪枝
作者:Xiang Meng, Shibal Ibrahim, Kayhan Behdin, Hussein Hazimeh, Natalia Ponomareva, Rahul Mazumder

Weisfeiler-Leman at the Margin: When More Expressivity Matters
Weisfeiler-Leman在边缘:当更多表现力很重要时
作者:Billy Joe Franks, Christopher Morris, Ameya Velingker, Floris Geerts

发表评论

您的邮箱地址不会被公开。 必填项已用 * 标注