Big Data:
Big data is the data
which it’s growing from scarce to superabundant day by day. We all are making
big buzz about big data, it’s good and big-headache to analyze. Now this is a
very hot topic. It’s defined as Volume, Velocity, and Variety by IBM.As you
know we all are using Facebook account to upload photos, videos and messages on
daily basis.
•
7TB
of data are processed by Twitter every day.
•
10TB
of data are processed by Facebook every day.
•
2
billion internet users in the world today.
• 4.6 billion mobile phones in 2011.
Rollin Ford, the CIO of Wal-Mart said “Every day I wake up and ask,
‘how can I flow data better, manage data better, analyse data better?”.
Big Data History:
As we know, we are always gathering information about documents, photos
and videos in storage devices by using latest technologies and devices (with
large pixels). In Olden days, people collected the papers and stored in the
warehouse or library in form of records, books etc.
The fact of the
matter is big data is not new. In 1940s, Libraries effected and had to
adapt their storage methods to meet the quickly increasing papers of new
publications and re-search and encounter the first attempts to qualify the
growth rate in the volume of data as information explosion in 1941.
In 1997s, the term “Big Data” was used for first time in an article by
NASA researchers M.Cox and D. Ellsworth to identify the rise of data was
becoming an issue for current computer systems.
In 1999s, Doug Cutting wrote the Lucene is the key component of search
index in Apache Nutch, Apache Solr and Elastic Search. Nutch faced some troubles
to manage computations running on parallel computers while building an open
source engine. In 2003 and 2004, Google published its GFS and Map reduces
papers, the route became clear to implement Hadoop by Doug Cutting in 2006.
Today, companies in every industry is facing big data problems which
generated by their daily-increased ability to collect information.
Big Data Key Challenges:
1.
Performance
2.
Network Traffic
3.
Risk Failure
4.
Analysing the data
5.
Addressing data quality
6.
Displaying meaningful results
7.
Dealing with reports by using graphical presentation
Big Data Technologies:
- Data Scientist
- In-Memory databases
- Distributed Databases
- NO-SQL
- Data warehouse appliances
- Hadoop
No comments:
Post a Comment