Visual Quality Assessment Competition (VQualA)

ICCV 2025

Topics

Visual quality assessment plays a crucial role in computer vision, especially in tasks like image quality assessment (IQA), image super-resolution, and document image enhancement. Traditional visual quality assessment techniques often rely on scalar metrics such as the peak signal-to-noise ratio (PSNR) and the structural similarity index measure (SSIM), which do not capture the perceptual quality of images or videos as experienced by humans. Since visual quality assessment has become more critical across fields such as medical imaging, satellite remote sensing, and document processing, there is a growing need for more comprehensive evaluation methods that consider human perception more accurately.

Additionally, recent advances in multimodal large language models (MLLMs) have expanded the potential for visual quality assessment by incorporating open-ended questions and natural language explanations, enabling a more nuanced understanding of visual quality. However, current methods are mainly focused on absolute quality ratings, which have inherent ambiguities. Therefore, there is an urgent need to explore visual quality comparison using MLLMs, particularly in the context of open-ended comparative assessments. This approach would enable a more reliable and consistent evaluation of visual quality, improving the alignment of computer vision models with human perceptual judgments.

Confirmed Speakers

Alan Bovik

Prof. Alan Bovik

Prof. Alan Bovik (HonFRPS) holds the Cockrell Family Endowed Regents Chair in Engineering in the Chandra Family Department of Electrical and Computer Engineering in the Cockrell School of Engineering at The University of Texas at Austin, where he is Director of the Laboratory for Image and Video Engineering (LIVE). He is a faculty member in the Department of Electrical and Computer Engineering, the Wireless Networking and Communication Group, and the Institute for Neuroscience. His research interests include digital television, digital photography, visual perception, social media, and image and video processing. His work broadly focuses on creating new theories and algorithms that allow for the perceptually optimized streaming and sharing of visual media. The outcomes of his work have the benefits of ensuring the visual satisfaction of billions of viewers worldwide, while substantially reducing global bandwidth consumption. He has published over 1,000 technical articles in these areas. His publications have been cited more than 175,000 times in the literature, his H-index is above 135, and he is listed as a Highly-Cited Researcher by The Web of Science Group. His several books include the Handbook of Image and Video Processing (Academic Press, 2000, 2005), Modern Image Quality Assessment (2006), and the companion volumes The Essential Guides to Image and Video Processing (Academic Press, 2009).

Balu Adsumilli

Dr. Balu Adsumilli

Dr. Balu Adsumilli (IEEE Fellow) is the Head of Media Algorithms group at YouTube/Google, where he and his team research and develop algorithms to transform the uploaded videos to formats played across all your devices. Over the past years, he was instrumental in building and scaling technologies in the areas of video processing, computer vision, video compression, and video quality, which garnered Two Technology and Engineering Emmy awards for Google. Prior to YouTube, he was the Director of Advanced Technology at GoPro, where he led the Camera Architecture, and the Advanced Software teams, and developed their ProTune mode in collaboration with ACES and Technicolor. This paved the way for GoPro cameras capturing Industry neutral formats, and enabled their widespread applicability in the movie and television industry. Dr. Adsumilli serves on the board of the Television Academy, on the Visual Effects Society board, on the NATAS technical committee, on the IEEE Multimedia Signal Processing (MMSP) Technical Committee, the IEEE Image, Video, Multidimensional Signal Processing (IVMSP) Technical Committee, and on ACM Mile High Video Steering Committee. He has co-authored 125+ technical publications and holds 200+ US patents. He is on TPCs and organizing committees for various conferences and organized numerous workshops. He is a Fellow of IEEE, and an active member of ACM, SMPTE, VES, SPIE, and the Internet Society. He received his PhD from the University of California Santa Barbara, and masters from the University of Wisconsin Madison.

Competition

List of Organizers

Chris Zhou
Chris Wei Zhou
Assistant Professor, Cardiff University, UK
zhouw26@cardiff.ac.uk
Jian Wang
Jian Wang
Staff Research Scientist, Snap Research, USA
jwang4@snap.com
Sizhuo Ma
Sizhuo Ma
Senior Research Scientist, Snap Research, USA
sma@snap.com
Xiongkuo Min
Xiongkuo Min
Associate Professor, Shanghai Jiao Tong University, China
minxiongkuo@sjtu.edu.cn
Guangtao Zhai
Guangtao Zhai
Professor, Shanghai Jiao Tong University, China
zhaiguangtao@sjtu.edu.cn
Zhengzhong Tu
Zhengzhong Tu
Assistant Professor, Texas A&M University, USA
tzz@tamu.edu
Hadi Amirpour
Hadi Amirpour
Assistant Professor, University of Klagenfurt, Austria
hadi.amirpour@aau.at
Shiqi Wang
Shiqi Wang
Associate Professor, City University of Hong Kong
shiqwang@cityu.edu.hk
Hanwei Zhu
Hanwei Zhu
Research Scientist, Nanyang Technological University, Singapore
hanwei.zhu@ntu.edu.sg
Yixiao Li
Yixiao Li
PhD student, Cardiff University, UK
liy369@cardiff.ac.uk
Fan Huang
Fan Huang
PhD student, Shanghai Jiao Tong University, China
huangfan@sjtu.edu.cn
Shuo Xing
Shuo Xing
PhD student, Texas A&M University, USA
shuoxing@tamu.edu