# How to get started with big data?

In this article, we are going to learn about **getting started with big data and Requisites for BIG DATA developer/engineer**.

Submitted by Uma Dasgupta, on August 12, 2018

In my previous article, I discussed what is big data? What are the differences between big data& data mining, what are the future scopes of big data? In this article, we will discuss **how to get started with big data**?

As we all know that field of big data is very vast and as it is a new technology these days so, it can be quite challenging to start learning whoever wants to do so. So, in this article, I will try to show you a guided path to start the journey with big data and grab a good job of big data in companies. Here, in this article I have tried to describe & guide through step by step, hope, it will help you all, to get on the right track.

### Steps

The first and the most important step is to determine the right role according to our interests and skillset we have. Without determining our goal (position or role) for which we want to prepare it will be like **"shooting in the dark"**.

So, first we have to determine our role, now basically if we speak broadly the roles in the big data industry can be classified into two major categories:

- Big Data Engineering
- Big Data Analytics

**Let us discuss about the roles and requisites of big data analyst and big data engineer/ developer.**

### Role of Big Data Engineer or developer

- As the word engineer suggests, you should know how to engineer the big data, means you should be able to see a different perspective or different outcome of the same data that all others are seeing as normal bulk or mess of data.
- You have to figure out that the bulk of data and also have to figure out how the SQL database works effectively.
- You should be able to handle the bulk of data using technologies like Hadoop/Spark, programming languages etc.

### Requisites for BIG DATA developer/engineer

- Should have a good grip over SQL, core Java, JS, OOAD (object-oriented analysis and design).
- Should have preliminary knowledge of R and python
- Mastering with different tools like Hadoop, MapReduce, Spark etc.
- Having a good analytical problem-solving approach, problem- solving, code writing skills.

Figure: Skillset for data scientist

Image source: https://www.experfy.com/blog/a-big-data-analyst-or-a-big-data-developer-what-do-you-want-to-becom

### Requisites for Big Data developer/engineering

The collection of huge data (big data) needs proper analysis for giving expectable and desired results, so, for its proper analysis, there is a need for appropriate database management systems. The main purpose of big data analyzation is to improve or expand the business of companies.

Tracking and using the different data of companies like transactions, searches, user profile information, user interest everything should be analyzed in order to extract fruitful outcome.

### Requisites for BIG DATA ANALYST

- Good knowledge of maths and statistics.
- Good knowledge of tools and techniques used in big data storing, processing and analysis, such as Hadoop, Spark etc.
- You should have a focused mind and very clear concepts of maths, statistics and different Big Data technologies in order to become a good big data analyst.

Figure: Big data analytics

Image source: https://www.experfy.com/blog/a-big-data-analyst-or-a-big-data-developer-what-do-you-want-to-become

Now, when we have an understanding of roles that suits our skills and interest, so now once we have decided on our role, we can begin with the learning process. As we all know that domain of Big Data is full with various technologies, so, its quite difficult to master them all, so, below I am attaching a Big data engineer/ analyst path flow chart.

Figure: flow chart for big data engineer path

Image source: https://www.analyticsvidhya.com/blog/2017/03/big-data-learning-path-for-all-engineers-and-data-scientists-out-there/

**Conclusion:**

I hope this article might have helped you to clear up some of the mess running over your mind about how to start learning big data! So, now when once you are decided with your role you can follow the tree and also refer to some top certifications that I mentioned in my previous articles Big Data: an emerging trend on IT sector.

So, now wear your wings and get started, will see you in the next article. Till then Stay connected, be healthy and keep learning!

TOP Interview Coding Problems/Challenges

- Run-length encoding (find/print frequency of letters in a string)
- Sort an array of 0's, 1's and 2's in linear time complexity
- Checking Anagrams (check whether two string is anagrams or not)
- Relative sorting algorithm
- Finding subarray with given sum
- Find the level in a binary tree with given sum K
- Check whether a Binary Tree is BST (Binary Search Tree) or not
- 1[0]1 Pattern Count
- Capitalize first and last letter of each word in a line
- Print vertical sum of a binary tree
- Print Boundary Sum of a Binary Tree
- Reverse a single linked list
- Greedy Strategy to solve major algorithm problems
- Job sequencing problem
- Root to leaf Path Sum
- Exit Point in a Matrix
- Find length of loop in a linked list
- Toppers of Class
- Print All Nodes that don't have Sibling
- Transform to Sum Tree
- Shortest Source to Destination Path

Comments and Discussions