* UC 얼바인 머신러닝 저장소 : 데이터셋
http://archive.ics.uci.edu/ml/index.php
UCI Machine Learning Repository
Welcome to the UC Irvine Machine Learning Repository! We currently maintain 471 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit ou
archive.ics.uci.edu
* 캐글 데이터 셋 : 데이터 셋 및 각종 분석 모델 공유
https://www.kaggle.com/datasets
Datasets | Kaggle
www.kaggle.com
* 아마존 데이터 셋 : 별로 연습용으로 활용하기 좋지 않음/ AWS 솔루션 사용자를 위한 데이터 셋
https://registry.opendata.aws/
Registry of Open Data on AWS
eventsdisaster response This project monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, counts, themes, sources, emotions, quotes, images
registry.opendata.aws
* 각 국가의 공공 데이터 링크 모은 사이트 : 각 국가마다 사이트 가입 필요(한국은 현재 16개)
DataPortals.org - A Comprehensive List of Open Data Portals from Around the World
This service is run by Open Knowledge International | Source Code | Download Data (CSV) | Download Data (JSON) | Data License (Public Domain) | Privacy Policy
dataportals.org
* 유럽 국가의 공개 데이터 링크 모음: 링크를 많이 타고 가야해서 불편
https://opendatamonitor.eu/frontend/web/index.php?r=dashboard%2Findex
OpenDataMonitor
This measure is an average of the missing metadata across a defined set of fields: licence, author, organisation, date released and date updated.
opendatamonitor.eu
* 유료 데이터 셋(일부 무료) : 고유 데이터 판매 가능
Quandl
The source for financial, economic, and alternative datasets, serving investment professionals.
www.quandl.com
* 위키백과: 머신러닝 주요 데이터 셋 목록
https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research
List of datasets for machine-learning research - Wikipedia
These datasets are used for machine-learning research and have been cited in peer-reviewed academic journals. Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such
en.wikipedia.org
* 데이터셋 리스트 모음 링크
https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
Where can I find large datasets open to the public?
Answer (1 of 215): Large data sets mostly from finance and economics that could also be applicable in related fields studying the human condition: World Bank Data. Lots of years. Lots of Countries Countries | Data. Lots of of data variables (Topics | Data
www.quora.com
* 데이터 셋 서브레딧
https://www.reddit.com/r/datasets
Datasets • r/datasets
A place to share, find, and discuss Datasets.
www.reddit.com
* 카네기 멜론 대학교 통계학과 데이터 셋
http://lib.stat.cmu.edu/datasets/
StatLib---Datasets Archive
StatLib---Datasets Archive If you have an interesting dataset, or collection of data from a book, please consider submitting the data. To submit a dataset, please see the submissions guidelines, via send submissions from general Some of the entries are sha
lib.stat.cmu.edu
* 깃허브 유명 데이터 셋
https://github.com/awesomedata/awesome-public-datasets
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets. PR ☛☛☛. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub.
github.com
'데이터분석 > 머신러닝' 카테고리의 다른 글
[sklearn] train_test_split 사용하는 방법 및 유의사항 (0) | 2021.05.09 |
---|---|
[회귀분석] 회귀분석 모델 한 번에 돌려서 가장 좋은 성능 모델 값 뽑기 (0) | 2020.02.24 |
[모델 선택하기] 머신러닝(지도학습,비지도학습,강화학습)/딥러닝 (0) | 2018.07.19 |
[기초개념] 데이터 분석 관점에서 한줄로 정리한 '머신러닝 딥러닝 데이터 분석을 하기 위해 꼭 알아야할 기본 개념' (0) | 2018.05.28 |
[데이터 분석] 의미있는 피처(컬럼) 선택 (0) | 2018.05.17 |