Bitcoin Forum
June 30, 2024, 09:44:51 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: Gathering and curation of datasets  (Read 86 times)
vn3t (OP)
Newbie
*
Offline Offline

Activity: 13
Merit: 0


View Profile
December 28, 2017, 04:00:54 PM
 #1

Gathering and quality control of structured data are the main hassles of machine learning engineers. Gathering methods generally produce a single set of structured data (this can be mitigated by smart gathering methods, classifications and some software features)

Examples of datasets gathering method:
●   Bots: Automated script crawling the web of database to “mine” data;
●   Thirds parties: External third parties which will provide inputs , that are aggregated together to form new set of structured data;
●   Bulk buys: External data processors or open sources datasets.
HappyMod
Jr. Member
*
Offline Offline

Activity: 224
Merit: 2

ICO Communtiy Management & Engagement happymod.io


View Profile WWW
March 29, 2018, 11:16:08 AM
 #2

Agreed. Lots of machine learning projects require big datasets, but those generally aren't available (either in quantity, or quality).

Startup for Cryptocurrency Community Management, Engagement and Marketing. Expert in ICO Advising and moderation of social media platforms happymod.io
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!