Data is better than oil, or the sixth set for a big data program

    Habr, hello! It’s hard to believe, but on March 16th we will launch the 6th set of our program “Big Data Specialist”.

    image

    Currently, we already have about 160 graduates who, with varying degrees of involvement, apply the knowledge and skills acquired in the program. Probably, one may wonder whether so many frames are needed. There are two answers to this reasonable doubt. First, we keep abreast and periodically conduct market analysis . Secondly, the market is not a static entity and is growing, and the number of open vacancies is not a sufficient metric to measure this demand.

    Two years ago, when we launched the program for the first time, there was no need for so many specialists. But, as it turned out later, most of our students do not change their place of work, but use the acquired skills and knowledge in their companies. And for the most part, these vacancies are not published in the public domain, because there are not even such positions yet.

    Big data technology companies are becoming more and more. That is why we launched a separate program for managers who are now eyeing this topic and are going to conduct the competent implementation of these technologies in the future.

    If we recall the curve of the spread of innovation and technology on the market, it becomes clear that this growth can continue further.

    image


    At the moment, the tools for working with big data are most likely implemented by the first two classes: innovators and early followers, and representatives of the early majority are just starting to connect.

    As a summary: there is no reason to believe that the need for big data specialists will decrease. Rather, on the contrary, it will grow. The phrase now popular that the data is new oil is not entirely true. Data is better than oil. Everyone can have access to data if they learn to collect and process them correctly, which cannot be said about oil. So come to us to study.

    Despite the fact that this is already the 6th launch, we continue to work on the product and decided to add something new to it.

    1) Previously, we had 12 labs: one for each week of the program. This time, we decided to make 10 labs and 2 “graduation” projects, for which students will prepare throughout the module. One project will be called “Predicting the sex and age of a user by web logs,” and the second “Building a recommendation system for online retail.” You will have enough time to work on them and make them something that you can be proud of and that you can add to your portfolio of work.

    2) From time to time we were asked to do group tasks in the program. This time, part of the laboratory work involves group work. So there will be a reason to make friends with each other even more.

    3) At the moment, we are optimizing the work of our automatic checkers for laboratory work, so that the check takes up minimal time and does not infuriate. Plus, add a progress indicator on the program. At any time, it will be possible to see if there are enough points for this or that certificate or not.

    4) Traditionally, our cluster will have all the most modern and current libraries and, of course, the latest version of Apache Spark. About 80-90 percent of the work in this tool will take place on data frames. During the last launch, we migrated our training materials from RDD to this data structure.

    We are waiting for you at the program . It will not be boring!

    Also popular now: