Sunday, April 7, 2024

GPT-4V(ision) system card

Must read


GPT-4 with imaginative and prescient (GPT-4V) allows customers to instruct GPT-4 to investigate picture inputs offered by the consumer, and is the newest functionality we’re making broadly out there. Incorporating further modalities (akin to picture inputs) into giant language fashions (LLMs) is considered by some as a key frontier in synthetic intelligence analysis and improvement. Multimodal LLMs provide the potential of increasing the affect of language-only programs with novel interfaces and capabilities, enabling them to unravel new duties and supply novel experiences for his or her customers. On this system card, we analyze the protection properties of GPT-4V. Our work on security for GPT-4V builds on the work performed for GPT-4 and right here we dive deeper into the evaluations, preparation, and mitigation work performed particularly for picture inputs.



Supply hyperlink

More articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest article