Visual cognition is a key capability of human being, but a big challenge in artificial intelligence. We provide thousands of labeled video sequences and clips to help researchers train their models to "understand" what these videos represent.
Scene parsing is a core capability for autonomous driving technologies. We have collected and annotated a large amount of outdoor scenes captured by vehicle mounted sensors. The whole dataset will evolve to include RGB videos with per pixel annotation and high-accuracy depth, stereoscopic video, and panoramic images.
Machine Reading Comprehension (MRC) is one of the core abilities of artificial intelligence. We release DuReader, a large-scale real-world Chinese dataset for MRC to promote the research. DuReader contains more than 200K questions, 1M evidence documents and 420K human generated answers.Get started
Open-Domain Information Extraction (OIE) is a task of extracting important information from open-domain sentences. OIE are proven valuable in many artificial intelligence tasks such as text summarization, text comprehension, knowledge-based question answering systems, and more. We release SAOKE dataset, a human annotated dataset containing more than 40 thousand of Chinese sentences and the corresponding facts in SAOKE form.