
Overview of the most interesting materials on data analysis and machine learning No. 27 (December 15 - 21, 2014)

I present to you the next issue of a review of the most interesting materials on the topic of data analysis and machine learning.
General
IBM has made Watson Analytics available to everyone
Artificial intelligence is not a threat to us
Start using machine learning today - a good post from the author of the MachineLearningMastery blog, which will help beginners quickly learn basic things from the field of machine learning and start using machine learning algorithms in practice.
Baidu announces a breakthrough in speech recognition and claims to outperform Google and Apple
Top 10 Big Data Startups in 2014
5 Deep Learning Startups to Watch Out for in 2015
IBM Watson Analytics vs. Microsoft Azure Machine Learning (Part 1) - Comparison of two analytical systems from the authors of the blog KDnuggets.com.
Data Mining (and Statistical Analysis) the most requested skills according to LinkedIn for 2014
The most sought-after skills in Data Science and Data Mining are an interesting study from the authors of the KDnuggets.com blog.
List of open source machine learning tools from KDnuggets.com
The best data visualization projects in 2014
22 key big data terms everyone needs to understand
IBM Big Data & Analytics Hub Infographic: Four Vs in Big Data
Big Data Forecasts for 2015 from Big Data Analytics News
2014 R Resources List from Revolution Analytics
What every machine learning library can borrow from Vowpal Wabbit
Key Machine Learning Trends in 2014 Based on the Results of the Neural Information Processing Systems (NIPS) 2014 Conference
List of useful resources on R from DZone.com
2015 Big Data Forecasts
6 Big Data forecasts for 2015 from Information Week
2015 Data Science Highlights from Analytics Vidhya
The announcement of the new version of BabelNet is an article about the release of version 3.0 of the popular multilingual dictionary and the semantic network BabelNet, in which Russian is also present.
Announcement of Apache Spark 1.2
Htmlwidgets for R: a library for visualizing data in R using JavaScript
Scientific Approach to Solving Data Analysis Problems
Theory and algorithms of machine learning, code examples
Hacker Guide to Neural Networks. Schemes of real values. Templates in the "reverse" stream. One Neuron Example
Oil Rows in R
Security Scanners: Automatically Validate Vulnerabilities Using Fuzzy Sets and Neural Networks
Python linear regression implementation
Sentiment Analysis using kimono and MonkeyLearn
Optimizing memory usage in R is a useful article from the popular Yhat blog on optimizing memory usage in the R programming language.
Hierarchical clustering using R (using D3.js and Shiny)
Ask a Data Scientist: Ensemble Methods - another article from the popular insideBIGDATA portal from the Ask a Data Scientist series, in this issue we will talk about such a concept as Ensemble Methods.
Machine Learning Competitions
Online courses, training materials and literature
Stepic Online Course: Fundamentals of Statistics - The course introduces students to the basic concepts and methods of mathematical statistics.
Udacity Data Analyst Nanodegree - A brief overview of Udacity Data Analyst Nanodegree.
A course on data visualization using D3.js - not so long ago a new, quite curious course appeared on the Udacity online learning site, created jointly with Zipfian Academy and dedicated to the topic of data visualization and the use of the popular visualization library D3.js.
List of books on hands-on machine learning is a good list of books on hands-on machine learning from the author of the MachineLearningMastery blog.
The 14 Best Big Data Books in 2014
Overview of Introduction to Data Science with R
Overview of Data Science at the Command Line
Free e-book Big Data Basics
Free e-book Big Data Analytics for Dummies
Free e-book Practical Machine Learning: Innovations in Recommendation
Videos
Badoo talk video from Highload 2014
Data modeling in NoSQL - in this video, Jan Steemann (Senior Developer, triAGENS) will talk about how to correctly model data in NoSQL repositories and provide some good practical examples.
Apache Cassandra for beginners - in this post are two video lectures that will help you understand the basic concepts of Apache Cassandra.
IBM Watson in action
Data engineering
30-Year NBA Data Processing with MongoDB Aggregation
Краткое введение в экосистему Hadoop
10 прогнозов по экосистеме Hadoop на 2015 год
Почему 2015 будет годом NoSQL
SparkOnHBase от Cloudera — статья про интересный проект от компании Cloudera под названием SparkOnHBase с примерами использования.
16 NoSQL хранилищ, за которыми стоит следить — полезный список из 16 NoSQL хранилищ с небольшим описанием каждого с блога KDnuggets.com.
Введение в NoSQL — неплохой краткий рассказ про NoSQL хранилища от автора блога Analytics Vidhya.
Прогнозы на 2015 год в области хранилищ данных от DataVersity
10 лучших постов с блога Cloudera в 2014 году
Обзоры
Интересное из мира R (15-21 декабря 2014 г.)
Еженедельный дайджест от DataScienceCentral (22 декабря)
Лучшие материалы за неделю от KDnuggets.com (7 — 14 декабря)
The weekly collection of the best materials from R1Soft (December 19)
Best Resources of the Week from Data Elixir (No.14)
The most interesting materials from Freakonometrics No. 193
The most interesting materials from Freakonometrics No. 194
The most interesting materials on High Scalability (December 19)
Previous issue: Overview of the most interesting materials on data analysis and machine learning No. 26 (December 8-14, 2014)