Μελετητής Google

X Min, H Duan, W Sun, Y Zhu, G Zhai - Science China Information …, 2024 - Springer

Perceptual video quality assessment plays a vital role in the field of video processing due to
the existence of quality degradations introduced in various stages of video signal …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 79 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Cambrian-1: A fully open, vision-centric exploration of multimodal llms

S Tong, E Brown, P Wu, S Woo, M Middepogu… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-
centric approach. While stronger language models can enhance multimodal capabilities, the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 170 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Towards open-ended visual quality comparison

H Wu, H Zhu, Z Zhang, E Zhang, C Chen, L Liao… - … on Computer Vision, 2024 - Springer

Comparative settings (eg. pairwise choice, listwise ranking) have been adopted by a wide
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 36 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

A comprehensive study of multimodal large language models for image quality assessment

T Wu, K Ma, J Liang, Y Yang, L Zhang - European Conference on …, 2024 - Springer

Abstract While Multimodal Large Language Models (MLLMs) have experienced significant
advancement in visual understanding and reasoning, their potential to serve as powerful …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 23 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] LLMs and generative agent-based models for complex systems research

Y Lu, A Aleta, C Du, L Shi, Y Moreno - Physics of Life Reviews, 2024 - Elsevier

Abstract The advent of Large Language Models (LLMs) has significantly transformed
research across natural and social sciences, offering new paradigms for understanding …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 7 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Depicting beyond scores: Advancing image quality assessment through multi-modal language models

Z You, Z Li, J Gu, Z Yin, T Xue, C Dong - European Conference on …, 2024 - Springer

We introduce a Depict ed image Q uality A ssessment method (DepictQA), overcoming the
constraints of traditional score-based methods. DepictQA allows for detailed, language …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 26 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] arxiv.org

Q-align: Teaching lmms for visual scoring via discrete text-defined levels

H Wu, Z Zhang, W Zhang, C Chen, L Liao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org

The explosion of visual content available online underscores the requirement for an
accurate machine assessor to robustly evaluate scores across diverse types of visual …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 93 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Aesexpert: Towards multi-modality foundation model for image aesthetics perception

Y Huang, X Sheng, Z Yang, Q Yuan, Z Duan… - Proceedings of the …, 2024 - dl.acm.org

The highly abstract nature of image aesthetics perception (IAP) poses a significant
challenge for current multimodal large language models (MLLMs). The lack of human …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 16 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Aigc-vqa: A holistic perception metric for aigc video quality assessment

Y Lu, X Li, B Li, Z Yu, F Guan, X Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

With the development of generative models such as the diffusion model and auto-regressive
model AI-generated content (AIGC) is experiencing an explosive growth. Moreover existing …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 6 Σχετικά άρθρα Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Promptiqa: Boosting the performance and generalization for no-reference image quality assessment via prompts

Z Chen, H Qin, J Wang, C Yuan, B Li, W Hu… - European Conference on …, 2024 - Springer

Due to the diversity of assessment requirements in various application scenarios for the IQA
task, existing IQA methods struggle to directly adapt to these varied requirements after …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 6 Σχετικά άρθρα Όλες οι 2 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Q-instruct: Improving low-level visual abilities for multi-modality foundation models

Perceptual video quality assessment: A survey

Cambrian-1: A fully open, vision-centric exploration of multimodal llms

Towards open-ended visual quality comparison

A comprehensive study of multimodal large language models for image quality assessment

[HTML][HTML] LLMs and generative agent-based models for complex systems research

Depicting beyond scores: Advancing image quality assessment through multi-modal language models

Q-align: Teaching lmms for visual scoring via discrete text-defined levels

Aesexpert: Towards multi-modality foundation model for image aesthetics perception

Aigc-vqa: A holistic perception metric for aigc video quality assessment

Promptiqa: Boosting the performance and generalization for no-reference image quality assessment via prompts