Review

[논문리뷰] SAM 3: Segment Anything with Concepts

이 [arXiv]에 게시한 'SAM 3: Segment Anything with Concepts' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] RynnVLA-002: A Unified Vision-Language-Action and World Model

이 [arXiv]에 게시한 'RynnVLA-002: A Unified Vision-Language-Action and World Model' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations

Noam Koenigstein이 [arXiv]에 게시한 'Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Planning with Sketch-Guided Verification for Physics-Aware Video Generation

Shayegan Omidshafiei이 [arXiv]에 게시한 'Planning with Sketch-Guided Verification for Physics-Aware Video Generation' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs

이 [arXiv]에 게시한 'Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

이 [arXiv]에 게시한 'OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] OmniScientist: Toward a Co-evolving Ecosystem of Human and AI Scientists

Weiquan Lin이 [arXiv]에 게시한 'OmniScientist: Toward a Co-evolving Ecosystem of Human and AI Scientists' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

이 [arXiv]에 게시한 'O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models

이 [arXiv]에 게시한 'Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging

이 [arXiv]에 게시한 'MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

이 [arXiv]에 게시한 'Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Loomis Painter: Reconstructing the Painting Process

이 [arXiv]에 게시한 'Loomis Painter: Reconstructing the Painting Process' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Insights from the ICLR Peer Review and Rebuttal Process

Nedjma Ousidhoum이 [arXiv]에 게시한 'Insights from the ICLR Peer Review and Rebuttal Process' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

이 [arXiv]에 게시한 'GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models

Serena Yeung-Levy이 [arXiv]에 게시한 'Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Diversity Has Always Been There in Your Visual Autoregressive Models

Yaxing Wang이 [arXiv]에 게시한 'Diversity Has Always Been There in Your Visual Autoregressive Models' 논문에 대한 자세한 리뷰입니다.

2025년 11월 24일

[논문리뷰] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

이 [arXiv]에 게시한 'Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO' 논문에 대한 자세한 리뷰입니다.

2025년 11월 21일

[논문리뷰] V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models

Baijiong Lin이 [arXiv]에 게시한 'V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models' 논문에 대한 자세한 리뷰입니다.

2025년 11월 21일

[논문리뷰] TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

이 [arXiv]에 게시한 'TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval' 논문에 대한 자세한 리뷰입니다.

2025년 11월 21일

[논문리뷰] TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding

이 [arXiv]에 게시한 'TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding' 논문에 대한 자세한 리뷰입니다.

2025년 11월 21일