List Headline Image
Updated by Steven Berke on May 26, 2017
Steven Berke Steven Berke
9 items   1 followers   0 votes   8 views

Big data and machine learning

Data Lake and the rise of the microservices

By simply looking at structured and unstructured data, Data Lakes enable companies to understand correlations between existing and new external data - such as …

bliki: DataLake

Data Lake is a term that's appeared in this decade to describe
an important component of the data analytics pipeline in the world of
Big Data. The idea is to
have a single store for all of the raw data that anyone in an
organization might need to analyze. Commonly people use
Hadoop to work on the data in the lake, but the concept is broader
than just Hadoop.

Using microservices to evolve beyond the data lake

The better prepared you are to utilize all the data in your data lake, the more likely you are to be successful​.

Apache Mesos

Apache Mesos abstracts CPU, memory, storage, and other
compute resources away from machines (physical or virtual), enabling
fault-tolerant and elastic distributed systems to easily be built
and run effectively.




Cross Industry Standard Process for Data Mining, commonly known by its acronym CRISP-DM,[1] is a data mining process model that describes commonly used approaches that data mining experts use to tackle problems. Polls conducted at one and the same website (KDNuggets) in 2002, 2004, 2007 and 2014 show that it was the leading methodology used by industry data miners who decided to respond to the survey.[2][3][4][5] The only other data mining standard named in these polls was SEMMA. However, 3-4 times as many people reported using CRISP-DM. A review and critique of data mining process models in 2009 called the CRISP-DM the "de facto standard for developing data mining and knowledge discovery projects."[6] Other reviews of CRISP-DM and data mining process models include Kurgan and Musilek's 2006 review,[7] and Azevedo and Santos' 2008 comparison of CRISP-DM and SEMMA.[8] Efforts to update the methodology started in 2006, but have As of 30 June 2015[update] not led to a new version, and the "Special Interest Group" (SIG) responsible along with the website has long disappeared (see History of CRISP-DM).


lime - Lime: Explaining the predictions of any machine learning classifier

uncanny valley

Today we are happy to share our most recent development: a brand new Beep Boop Slack bot 🎉. It allows you to manage projects, subscribe to notifications from any channel, and chat with us from the…

Creating a smart ChatBot for Slack

Oleg Korol's personal blog. Thoughts on Web Development and Tech in general. JavaScript Full-Stack.