Visual cognition is a key capability of human being, but a big challenge in artificial intelligence. We provide thousands of labeled video sequences and clips to help researchers train their models to "understand" what these videos represent.
Scene parsing is a core capability for autonomous driving technologies. We have collected and annotated a large amount of outdoor scenes captured by vehicle mounted sensors. The whole dataset will evolve to include RGB videos with per pixel annotation and high-accuracy depth, stereoscopic video, and panoramic images.
Machine Reading Comprehension (MRC) is one of the core abilities of artificial intelligence. We release DuReader, a large-scale real-world Chinese dataset for MRC to promote the research. DuReader contains more than 200K questions, 1M evidence documents and 420K human generated answers.