Performing image search based on user input using neural networks.
U.S. Patent No. 11,914,635. Washington, DC: U.S. Patent and Trademark Office., 2024
Recommended citation: Victor S. Bursztyn, Jennifer Anne Healey, Vishwa Vinay, and Tong Sun. 2024. Performing image search based on user input using neural networks. U.S. Patent No. 11,914,635. Washington, DC: U.S. Patent and Trademark Office. https://patents.google.com/patent/US11914635B2/en
Systems and methods for image searching are described. The systems and methods include receiving a search query comprising user input for a reference image; converting the user input for the reference image to a preference statement using a machine learning model; encoding the preference statement in an embedding space to obtain an encoded preference statement; combining the encoded preference statement with an encoded reference image representing the reference image in the embedding space to obtain a multi-modal search encoding; and performing a search operation using the multi-modal search encoding to retrieve a second image, wherein the second image differs from the reference image based on the user input for the reference image.
