Google's filing details a system that uses machine learning to detect the presence of specific products in image and video data.
The system relies on a fusion model - a deep learning model that can consider multiple types of data - which calculates confidence values determining the likelihood that the text or images contain information related to a specific product.
[
add
]
[
|
|
...
]