 March 15, 2015 at 18:58
 March 15, 2015 at 18:58Overview of the most interesting materials on data analysis and machine learning No. 39 (March 9 - 15, 2015)

I present to you the next issue of a review of the most interesting materials on the topic of data analysis and machine learning.
General
   Risk Protection Machine Learning System Architecture Risk Protection Machine Learning System Architecture
 SQL-like queries for real-time streaming analytics SQL-like queries for real-time streaming analytics
 Announcement of Apache Spark 1.3 - A brief overview of the features of the new version of Apache Spark. Announcement of Apache Spark 1.3 - A brief overview of the features of the new version of Apache Spark.
   New version R 3.1.3 released New version R 3.1.3 released
 Apache Spark: Star Rise Apache Spark: Star Rise
Theory and algorithms of machine learning, code examples
     Machine Learning - 2. Non-linear regression and numerical optimization - statistics on views and target actions of the audience are accumulated, and it was she who served as the starting point for this article. In it, the author will briefly consider an example of nonlinear regression (namely, exponential) and with its help we construct a conversion model by distinguishing two groups among users. Machine Learning - 2. Non-linear regression and numerical optimization - statistics on views and target actions of the audience are accumulated, and it was she who served as the starting point for this article. In it, the author will briefly consider an example of nonlinear regression (namely, exponential) and with its help we construct a conversion model by distinguishing two groups among users.
   Working with meta-network structures in Python - MetaNet library - in this article, the author will talk about some of the prerequisites for the emergence of a tool for modeling meta-networks. Working with meta-network structures in Python - MetaNet library - in this article, the author will talk about some of the prerequisites for the emergence of a tool for modeling meta-networks.
   Visual linear approximation using Gnuplot Visual linear approximation using Gnuplot
 Deep Learning Equipment Selection Guide Deep Learning Equipment Selection Guide
 Deep Learning, Curse of Dimension and Auto Encoders Deep Learning, Curse of Dimension and Auto Encoders
 Using Deep Learning to Understand Text Information Using Deep Learning to Understand Text Information
   Python: scikit-learn - training a classifier with non-numeric characters Python: scikit-learn - training a classifier with non-numeric characters
     How machine learning algorithms work (part 1). Artificial neurons and single-layer neural networks How machine learning algorithms work (part 1). Artificial neurons and single-layer neural networks
 Implementing a naive Bayesian classifier on Apache Flink Implementing a naive Bayesian classifier on Apache Flink
   Machine Learning for Beginners (Part 1) Machine Learning for Beginners (Part 1)
 Genetic Algorithm Description Genetic Algorithm Description
   Python data processing and machine learning. Presentation and code examples Python data processing and machine learning. Presentation and code examples
   Python k-means clustering Python k-means clustering
 Introduction to Microsoft Azure Machine Learning Studio Introduction to Microsoft Azure Machine Learning Studio
 Improving Apache Spark Performance (Part 1) Improving Apache Spark Performance (Part 1)
 Gravitational Clustering: A new learning algorithm with a teacher. Description and implementation Gravitational Clustering: A new learning algorithm with a teacher. Description and implementation
Machine Learning Competitions
Online courses, training materials and literature
   Peter Flach's book on machine learning translated into Russian Peter Flach's book on machine learning translated into Russian
   Online course at Coursera: Process Mining: Data science in Action Online course at Coursera: Process Mining: Data science in Action
   Online Course: Text Retrieval and Search Engines Online Course: Text Retrieval and Search Engines
   Online Course at Coursera: Applied Regression Analysis Online Course at Coursera: Applied Regression Analysis
   Johns Hopkins University Online Course: Mathematical Biostatistics Boot Camp 1 Johns Hopkins University Online Course: Mathematical Biostatistics Boot Camp 1
   Free eBook Review: Data Driven: Creating a Data Culture Free eBook Review: Data Driven: Creating a Data Culture
Videos, podcasts
   Introduction to Deep Learning. Set of video lectures Introduction to Deep Learning. Set of video lectures
   Top 10 Data Mistakes Top 10 Data Mistakes
   Talking Machines: Episode 6: Interviews with Geoffrey Hinton, Yoshua Bengio and Yann LeCun: The Future of Machine Learning from the Inside is the sixth episode of the Talking Machines podcast series, in this case a continuation of a conversation with Geoffrey Hinton (Google, University of Toronto), Yoshua Bengio (University of Montreal) and Yann LeCun (Facebook, NYU). Talking Machines: Episode 6: Interviews with Geoffrey Hinton, Yoshua Bengio and Yann LeCun: The Future of Machine Learning from the Inside is the sixth episode of the Talking Machines podcast series, in this case a continuation of a conversation with Geoffrey Hinton (Google, University of Toronto), Yoshua Bengio (University of Montreal) and Yann LeCun (Facebook, NYU).
Data engineering
   Airpal: a web-based SQL application - Airpal is a web-based database application designed to complement Facebookâs PrestoDB when analyzing information. And in this post he talks about its capabilities and features. Airpal: a web-based SQL application - Airpal is a web-based database application designed to complement Facebookâs PrestoDB when analyzing information. And in this post he talks about its capabilities and features.
 Creating a Single View in MongoDb (Part 1): Overview and Data Analysis Creating a Single View in MongoDb (Part 1): Overview and Data Analysis
 Big Data Processing in Apache Spark Big Data Processing in Apache Spark
 Apache Spark with Neo4j using Docker Compose Apache Spark with Neo4j using Docker Compose
Reviews
 Interesting from the world of R (March 9-15, 2015) Interesting from the world of R (March 9-15, 2015)
 Best Content of the Week from KDnuggets.com (March 1 - 7) Best Content of the Week from KDnuggets.com (March 1 - 7)
 Best Content of the Week from KDnuggets.com (March 8-14) Best Content of the Week from KDnuggets.com (March 8-14)
 DataScienceCentral Weekly Digest (March 16th) DataScienceCentral Weekly Digest (March 16th)
 Data Science News from MyDataMine.com (March 15) Data Science News from MyDataMine.com (March 15)
 Big Data News from MyDataMine.com (March 12) Big Data News from MyDataMine.com (March 12)
 Best Resources of the Week from Data Elixir (No. 26) Best Resources of the Week from Data Elixir (No. 26)
 The weekly collection of the best materials from R1Soft (March 13) The weekly collection of the best materials from R1Soft (March 13)
 The most interesting materials on High Scalability (March 13) The most interesting materials on High Scalability (March 13)
Previous issue: Overview of the most interesting materials on data analysis and machine learning No. 38 (March 2 - 8, 2015)