Computer Vision Datasets
Computer vision is a field of artificial intelligence that trains computers to interpret and understand the visual world. Using digital images from cameras and videos and deep learning models, machines can accurately identify and classify objects — and then react to what they “see.”
Displaying Page 5 of 15 (146 total Repositories)
871.6 mb
9.8K2
346.8 mb
8.2K
6.3 mb
10121
3.9 mb
10012
A benchmark collection for sentence-based image description and search, consisting of 8,000 images that are each paired with five different captions which provide clear descriptions of the salient entities and events. … The images were chosen from six different Flickr groups, and tend not to contain any well-known people or locations, but were manually selected to depict a variety of scenes and situations.
1.1 gb
98.1K3
56.6 mb
12833
64.4 mb
2144K
42.4 mb
12260
7.2 gb
713K