成为VIP会员查看完整内容
VIP会员码认证
首页
主题
会员
服务
注册
·
登录
模态
关注
4
综合
百科
VIP
热门
动态
论文
精华
FCMBench: A Comprehensive Financial Credit Multimodal Benchmark for Real-world Applications
Arxiv
0+阅读 · 1月6日
Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey
Arxiv
0+阅读 · 1月6日
RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence
Arxiv
0+阅读 · 1月6日
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
Arxiv
0+阅读 · 1月6日
Multimodal oscillator networks learn to solve a classification problem
Arxiv
0+阅读 · 1月6日
MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection
Arxiv
0+阅读 · 1月6日
Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores
Arxiv
0+阅读 · 1月6日
Protecting multimodal large language models against misleading visualizations
Arxiv
0+阅读 · 1月6日
Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval
Arxiv
0+阅读 · 1月6日
Omni2Sound: Towards Unified Video-Text-to-Audio Generation
Arxiv
0+阅读 · 1月6日
Scene-Aware Vectorized Memory Multi-Agent Framework with Cross-Modal Differentiated Quantization VLMs for Visually Impaired Assistance
Arxiv
0+阅读 · 1月6日
Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval
Arxiv
0+阅读 · 1月6日
LTX-2: Efficient Joint Audio-Visual Foundation Model
Arxiv
0+阅读 · 1月6日
Focus on What Matters: Fisher-Guided Adaptive Multimodal Fusion for Vulnerability Detection
Arxiv
0+阅读 · 1月5日
Advancing Assistive Robotics: Multi-Modal Navigation and Biophysical Monitoring for Next-Generation Wheelchairs
Arxiv
0+阅读 · 1月6日
参考链接
提示
微信扫码
咨询专知VIP会员与技术项目合作
(加微信请备注: "专知")
微信扫码咨询专知VIP会员
Top