Dear Technocrats,
In this post we are coming up with a series of interview preparation material for those who are looking to get entry in the field of big data analytics. First lets have some common interview tips for all:
- Be very attentive while listening before answering any question.
- Be very specific and precise in your answers.
- In today's fast paced changing IT industry, the recruiter is more focused on well educated personal than the well trained, understand the difference.
- Focus more on the outcome of learning, than the syntax of leaning. Having some idea of business use of your technology domain will be a plus.
- Show flexibility, rather than rigidity on any technology or platform specially for freshers.
- Asking one or two questions from the interviewer about his company is thought to be a good practice, but avoid making continuous arguments.
- A common question from the interviewer can be " when a person is called successful on this post?"
Now we are coming up with a set of questions which are expected to be asked in your interview.
Basic Questions:
- What do you think by big data and what are its solution techniques?
- What is the difference between structured and unstructured data? Support your answer with examples.
- What do you know about NoSQL databases? How those are different from RDBMS?
- What is Mapreduce? Explain its phases in detail.
- What is distributed file system? How it is different from usual file systems. Explain both with examples.
- What are the limitations/shortcomings of mapreduce framework?
- Do you know about IBM Watson? How it is helpful in big data analytics?
- Is there any relation in big data analytics and cloud computing?
- Define horizontal scalability and its benefits in hadoop framework?
- Explain the role and working of Namenode, datanode, Jobtracker & tasktracker.
- What is the difference between hadoop 1.x and hadoop 2.x?
- Explain Sharding and its importance.
Advance level questions:
- How kafka can be integrated with hadoop / spark for stream processing.
- What is the use of NiFi in big data processing frameworks.
- Which NoSQL database is suited for storage and processing of binary data (images).
- What is the difference between RDD & DataFrames in Spark.
For more such questions, discussions on polls & technical articles on latest technologies for big data analytics check out the posts on DataioticsHub Page
If you are new to big data analytics, please start reading basics from this post. To understand and learn complete technology stack on big data engineering, visit DataioticsHub