Mario Bojilov does take on Big Data and why not? In 2004, Mario Bojilov founded Meta Business Systems focusing on performance monitoring, business intelligence and social media analytics. Griffith University, Brisbane City Council, Queensland Urban Utilities and QUT are currently using solutions designed and implemented by Mario and Meta Business Systems and now, Mario, is a speaker in demand for many professional and academic bodies.
Last week, in Brisbane, his presentation with an intriguing head line – PIRATE ON THE HIGH SEA, TEENAGERS’ SECRETS AND DR SEUSS – Big Data in the Enterprise, at the Australian Computer Society event, was well structured and professionally delivered.
He talked about Big Data which is not really the new term (it was first mentioned in research report by analyst Doug Laney, in 2001) but recently the power and the enticement of Big Data has become a very popular topic for discussions. Mario reminded us that Doug Laney outlined three (“3Vs”) dimensional aspects of data growth: volume (increasing amount of data), velocity (in and out speed of data) and variety (scope of data types and resources).
Where does this growing data come from? From everywhere: on-line videos, images, comments and posts; transaction records (billing, invoicing and online banking); real-time data from sensors (GPS data or weather forecast) and so on.
According to Mr. Bojilov, 90% of the world’s data was created in the last two and a half years. Logically, the increasing volume does speak for itself and according to IBM we are creating 2.5 quintillion bytes of data per day. But why has Big Data begun its wide-reaching impact on our lives today?
Well, take for example Nate Silver’s sensational glory of accurate predictions of outcomes in the latest political polls or sport games. Nate Silver’s seven most memorable predictions are: 2008 Presidential Election, 2013 College Football Championship, 2013 College Basketball Tournament, 2010 Midterm Elections, 2013 Academy Awards, 2013 NFL Playoffs & Super Bowl and 2012 Presidential Election. What method does he use to achieve such glory? For all I know – a poll averaging method with some adjustments and a sabermetric system named PECOTA (Player Empirical Comparison and Optimization Test Algorithm) which he created with his team. He nailed the analytics and demonstrated the potential of Big Data.
Of course Nate Silver’s example is quite notable but Mario Bojilov gave us different examples of how Big Data has the potential for benefiting communities, government and enterprises with staggering results in piracy prevention, reducing graffiti vandalism, predicting incidents and robberies, and identifying fraud.
Moving to the next aspect of Big Data which is the speed of data processing (velocity), Mario emphasized the vital role that data velocity could deliver competitive advantages for enterprises and social networks, particularly in customer satisfaction and identity fraud scenarios.
The third characteristic of Big Data is variety which has different forms: structured (neatly modeled and organized data such as customers log files or products), semi-structured (data with no strict formatting rules like Telco’s Call Data Records) or unstructured data (books, journals and multimedia). The challenge is to capture, organize and analyse such a wide variety of data that is coming from a variety of sources in a specific context.
Mr. Bojilov introduced a variety of popular solutions to tackle Big Data:
- A programming model for large scale data processing named Hadoop MapReduce which is a part of Apache software framework.
- Developed by SAS Institute Inc., Statistical Analysis System (SAS) is a software suit that can obtain, amend, manage and gather data from different sources; plus it can statistically analyse that data and it’s free for students and professors at universities.
- Weka (Waikato Environment for Knowledge Analysis) is a free machine learning software for data analysis and predictive modeling. It was developed at the University of Waikato, New Zealand.
- Professional Open-Source Software KNIME for data analytics and reporting.
- Data mining and machine learning software suite named Orange is an open source data visualization and analysis.
- An open-source document database – mongoDB has full index support and document-oriented storage and according to Mario data modeling is very easy with mongoDB and many companies are using it for different purposes: archiving, gaming, social analytics and content management.
- A highly scalable open source NoSQL database management system Cassandra (Apache Software Foundation project) was originally developed by Facebook to power their Inbox Search.
Overall, Mario brought to our attention insights of Big Data architecture and aspects of privacy, how to apply Benford’s law (the First-Digit Law) and see what appears out of norm; how to use different methods and seek different approaches through asking different questions in order to make Big Data more relevant.
At the end, he revealed that Dr. Seuss is his favorite character and he quoted:
“If you read with your eyes shut you’re likely to find
That the place where you’re going is far, far behind
SO…that’s why I tell you to keep your eyes wide.
Keep them wide open…at least on one side.”
Finally, he urged us to keep our eyes wide open and concluded: “Bigger data sets lead to more precise results.”
It was an excellent presentation!
Visit Elena’s profile on Pinterest.
WE RECOMMEND YOU TO VISIT FOLLOWING WEBSITES:
- BLOGS - Best blogs by category
- Des Walsh - Get the Social Edge for Your Business
- Dr Clarence Tan - Australian and Malaysian ambassador for Singularity University
- Dr Michael Rees - Linking the crowd with the cloud
- eBOOKS ONLINE DIRECTORY - Online directory for eBooks
- ELENA ORNIG - Professional Portfolio
- GOLD COAST TECHSPACE - Gold Coast TechSpace is a physical place where people come to hack around with and learn about technology
- GOLD COAST WRITERS ASSOCIATION - The Gold Coast Writers Association (GCWA) encourages and supports writers from all genres, age groups and levels of writing to grow, learn and excel in their craft.
- HELENSVALE WRITERS; GROUP - A very diverse group of writers, writing in both fictional and non-fiction categories.
- INTERVIEWS - Interviews with authors, business people, musicians and fascinating individuals
- PUBLISHING HOME - Publishing company Julia Sophistique Pty Ltd