Big Database Tutorial

Introduction

  • With an ever increasing growth of databases in activities like social networking, online shopping, e-learning, e-banking etc; managing such a huge amount of data is  very big challenge.
  • The traditional data management systems and other existing tools are facing difficulties to analyze and process such a big data.
  • 'Big Data' means a collection of the structured and unstructured data which is large and difficult to analyze and process with traditional data management system.

Four Vs of Big Data

Big data provides some important parameters for data processing. They are:

1. Volume:
It refers to the generation of large amount of data during data processing by using an application at every moment.
For example: Twitter Messages

2. Velocity:
It refers to the speed at which new data is generated and the speed at which data moves around the globe.
For example: Stock Exchange.

3. Variety:
It refers to the different types of data which are used in processing. The data can be structured or unstructured.
For example: In face-book, user can share data in the form of text, audio, video.

4. Veracity:
It refers to the trustworthiness of the data, Reliability data is important for organizations as well as users.
For example:  Twitter post with abbreviations.