社区所有版块导航
Python
python开源   Django   Python   DjangoApp   pycharm  
DATA
docker   Elasticsearch  
aigc
aigc   chatgpt  
WEB开发
linux   MongoDB   Redis   DATABASE   NGINX   其他Web框架   web工具   zookeeper   tornado   NoSql   Bootstrap   js   peewee   Git   bottle   IE   MQ   Jquery  
机器学习
机器学习算法  
Python88.com
反馈   公告   社区推广  
产品
短视频  
印度
印度  
Py学习  »  aigc

CV&AIGC顶会整理 [2024-10-23]

晓飞的算法工程笔记 • 5 月前 • 239 次点击  

今日更新17篇:

  • 计算机视觉会议 5篇
  • 自然语言处理会议 12篇
请注意,大模型的论文多发布于自然语言处理会议中。而由于多模态的发展迅速,部分计算机视觉相关的论文也会发布在自然语言处理顶会中。

计算机视觉会议: 5篇


[0] Time-Resolved MNIST Dataset for Single-Photon Recognition[cs.CV]
标题:快速瞬态MNIST数据集用于单光子识别
作者:Aleksi Suonsivu, Lauri Salmela, Edoardo Peretti, Leevi Uosukainen, Radu Ciprian Bilcu, Giacomo Boracchi
链接:http://arxiv.org/abs/2410.16744
备注:12 pages, 4 figures. Accepted for Workshop on Synthetic Data for Computer Vision at ECCV 2024

[1] One-Step Diffusion Distillation through Score Implicit Matching[cs.CV]
标题:一步扩散蒸馏通过评分隐式匹配
作者:Weijian Luo, Zemin Huang, Zhengyang Geng, J. Zico Kolter, Guo-jun Qi
链接:http://arxiv.org/abs/2410.16794
期刊:NeurIPS 2024
备注:Accepted by NeurIPS 2024

[2] Hierarchical Clustering for Conditional Diffusion in Image Generation[cs.CV]
标题:图像生成中的条件扩散分层聚类
作者:Jorge da Silva Goncalves, Laura Manduchi, Moritz Vandenhirtz, Julia E. Vogt
链接:http://arxiv.org/abs/2410.16910
备注:25 pages, submitted to ICLR 2025

[3] LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization[cs.CV]
标题:从多个专家学习领域泛化的简单框架:LFME
作者:Liang Chen, Yong Zhang, Yibing Song, Zhiqiang Shen, Lingqiao Liu
链接:http://arxiv.org/abs/2410.17020
代码:https://github.com/liangchen527/LFME
备注:Accepted by NeurIPS 2024

[4] Masked Differential Privacy[cs.CV]
标题:掩码差分隐私
作者:David Schneider, Sina Sajadmanesh, Vikash Sehwag, Saquib Sarfraz, Rainer Stiefelhagen, Lingjuan Lyu, Vivek Sharma
链接:http://arxiv.org/abs/2410.17098
期刊:Proceedings of the 2nd International Workshop on Privacy-Preserving Computer Vision, ECCV 2024

自然语言处理会议: 12篇


[0] Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S[cs.CL]
标题:斯苏盒或储蓄罐:评估加纳和美国之间的文化常识知识
作者:Christabel Acquaye, Haozhe An, Rachel Rudinger
链接:http://arxiv.org/abs/2410.16451
备注:Accepted to EMNLP 2024

[1] To the Globe (TTG): Towards Language-Driven Guaranteed Travel Planning[cs.CL]
标题:《走向世界(TTG):语言驱动确保旅行规划》
作者:Da JU, Song Jiang, Andrew Cohen, Aaron Foss, Sasha Mitts, Arman Zharmagambetov, Brandon Amos, Xian Li, Justine T Kao, Maryam Fazel-Zarandi, Yuandong Tian
链接:http://arxiv.org/abs/2410.16456
期刊:EMNLP 2024 Demo Track

[2] DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding[cs.CL]
标题:文档结构编辑:多模态大型语言模型锚定实现
作者:Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha
链接:http://arxiv.org/abs/2410.16472
备注:EMNLP 2024 (Main)

[3] Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method[cs.CL]
标题:正确解答后:基于后处理方法的跨段落问答增强
作者:Jiayi Lin, Chenyang Zhang, Haibo Tong, Dongyu Zhang, Qingqing Hong, Bingxuan Hou, Junli Wang
链接:http://arxiv.org/abs/2410.16788
备注:Accepted by EMNLP 2024 Findings

[4] Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning[cs.CL]
标题:可信的通过强化学习实现检索增强的大型语言模型的校准
作者:Zongmeng Zhang, Yufeng Shi, Jinhua Zhu, Wengang Zhou, Xiang Qi, Peng Zhang, Houqiang Li
链接:http://arxiv.org/abs/2410.16843
期刊:Proceedings of the 41st International Conference on Machine Learning, PMLR 235:59827-59850, 2024
备注:ICML 2024

[5] Learning Mathematical Rules with Large Language Models[cs.CL]
标题:利用大型语言模型学习数学规则
作者:Antoine Gorceix, Bastien Le Chenadec, Ahmad Rammal, Nelson Vadori, Manuela Veloso
链接:http://arxiv.org/abs/2410.16973
备注:4th MATH-AI Workshop at NeurIPS'24

[6] Data-driven Coreference-based Ontology Building[cs.CL]
标题:数据驱动核心提及基于本体构建
作者:Shir Ashury-Tahan, Amir David Nissan Cohen, Nadav Cohen, Yoram Louzoun, Yoav Goldberg
链接:http://arxiv.org/abs/2410.17051
期刊:EMNLP 2024

[7] Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations[cs.CL]
标题:人类与大型语言模型混合文本答案汇聚用于众包标注
作者:Jiyi Li
链接:http://arxiv.org/abs/2410.17099
备注:Accepted in EMNLP 2024

[8] Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ?[cs.CL]
标题:可以通用的大型语言模型泛化到英语-泰语机器翻译吗?
作者:Jirat Chiaranaipanich, Naiyarat Hanmatheekuna, Jitkapat Sawatphol, Krittamate Tiankanon, Jiramet Kinchagawat, Amrest Chinkamol, Parinthapat Pengpun, Piyalitt Ittichaiwong, Peerat Limkonchotiwat
链接:http://arxiv.org/abs/2410.17145
备注:Accepted in GenBench EMNLP 2024

[9] Fine-Tuning Large Language Models to Appropriately Abstain with Semantic Entropy[cs.CL]
标题:微调大型语言模型以合理地基于语义熵避免特定内容
作者:Benedict Aaron Tjandra, Muhammed Razzak, Jannik Kossen, Kunal Handa, Yarin Gal
链接:http://arxiv.org/abs/2410.17234
备注:Accepted to NeurIPS Safe Generative AI Workshop 2024

[10] Towards Reliable Evaluation of Behavior Steering Interventions in LLMs[cs.CL]
标题:迈向可靠评估LLM中的行为引导干预措施
作者:Itamar Pres, Laura Ruis, Ekdeep Singh Lubana, David Krueger
链接:http://arxiv.org/abs/2410.17245
备注:Accepted to the NeurIPS 2024 - Workshop on Foundation Model Interventions

[11] Altogether: Image Captioning via Re-aligning Alt-text[cs.CV]
标题:整体:通过重新对齐Alt文本来进行图文描述
作者:Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer
链接:http://arxiv.org/abs/2410.17251
备注:accepted by EMNLP 2024; MetaCLIPv2

感谢arxiv.org


Python社区是高质量的Python/Django开发社区
本文地址:http://www.python88.com/topic/175248
 
239 次点击