Attributions for open source datasets made available in Scale Rapid are available below. This document contains licensing information relating to the use of free and open-source software (FOSS) with or within the Scale Rapid software. Any terms, conditions, or restrictions on FOSS included within the Scale Rapid software that are not included within the original FOSS licenses are offered and imposed by Scale alone. The authors, licensors, and distributors of the FOSS disclaim all express or implied conditions, representations, and warranties relating to the FOSS and any liability arising from use and distribution of the FOSS. This document identifies the FOSS packages made available in the Scale Rapid software, the FOSS licenses that Scale believes govern those FOSS packages, and copyright and license notices associated with Scale’s use of the FOSS. While Scale has sought to provide complete and accurate licensing information for each FOSS package, Scale does not represent or warrant that the licensing information provided herein is correct or error-free. Recipients of the product should investigate the identified FOSS packages to confirm the accuracy of the licensing information provided herein. Recipients are also encouraged to notify Scale of any inaccurate information or errors found in these notices. Certain FOSS licenses, such as the Mozilla Public License, require Scale to make available to recipients the source code corresponding to FOSS binaries distributed under those licenses. Recipients who would like to receive a copy of such source code should submit a request to Scale by post at: Scale AI, Inc. Attn: FOSS Requests 303 2nd St, Fl 5, San Francisco, CA 94107. Please identify in submitted FOSS requests: the FOSS packages for which you are requesting source code; the Scale product and version number with which the requested FOSS package was distributed; an email address at which Scale may contact you regarding the request (if available); and the postal address for delivery of the requested source code. 

 

MNIST (http://yann.lecun.com/exdb/mnist/)

 

COCO 2020 (https://cocodataset.org/#home)

Copyright COCO Consortium

The annotations in this dataset are licensed under a Creative Commons Attribution 4.0 License. The COCO Consortium does not own the copyright of the images. Use of the images must abide by the Flickr Terms of Use. The users of the images accept full responsibility for the use of the dataset, including but not limited to use of any copies of copyrighted images that they may create from the dataset.

 

CIFAR-100 (https://www.cs.toronto.edu/~kriz/cifar.html)

 

Debagreement: Reddit 50K (https://scale.com/open-av-datasets/oxford)

This dataset is distributed by John Pougué-Biyong, Valentina Semenova, Alexandre Matton, Rachel Han, Aerin Kim, Renaud Lambiotte, and Doyne Farmer under a Creative Commons Attribution 4.0 International Public License (“CC BY 4.0”). 

 

Wikipedia Links Data (https://code.google.com/archive/p/wiki-links/downloads)

This dataset is distributed by Sameer Singh, Amarnag Subramanya, Fernando Pereira, and Andrew McCallum under a Creative Commons BY license

 

Speech Commands Dataset (https://ai.googleblog.com/2017/08/launching-speech-commands-dataset.html)

This dataset is released under a Creative Commons BY 4.0 license. 

 

FSDnoisy18k (https://zenodo.org/record/2529934#.Yz4TbezML6v)

 

This dataset is licensed under a Creative Commons BY 4.0 license. Iindividual audio clips are licensed under a Creative Commons BY 4.0 or CC0 1.0 Universal (CC0 1.0) license. 

More information about the licensing can be found at https://zenodo.org/record/2529934#.Yz4TbezML6v