“Big Data” is currently a big hype. Large amounts of historical data are stored in Hadoop or other platforms. Business Intelligence tools and statistical computing are used to draw new knowledge and to find patterns from this data, for example for promotions, cross-selling or fraud detection. The key challenge is how these findings can be […]

Read more

With Amazon Web Service’s re:Invent 2015 conference taking place this week, there’s been a steady stream of cloudy news releases, with new additions from both Elastic (formerly ElasticSearch) and MariaDB cropping up in the feeds. The Elastic collaboration comes in the form of a new Amazon Elasticsearch Service (Amazon ES for short). Developers can quickly […]

Read more

Couchbase is an open source, document-oriented NoSQL database for modern web, mobile, and IoT applications. It is designed for ease of development and Internet-scale performance. Couchbase 4.0 is a major release of the NoSQL database server  that includes significant advances in both architecture and features. Download Couchbase Server 4.0 now! The key features introduced in this version are: […]

Read more

Key-value NoSQL people Basho have been  steadfast in their quest to expand the database reach in the enterprise over the past 12 months. Having launched the Basho Data Platform earlier this year – an amalgamation of existing Riak technologies, including Riak KV (for key value), Riak CS cloud storage, Apache Spark cluster framework, Redis caching, […]

Read more

Following up from last week’s story, this is Part 2 of Adrian Colyer’s journey to get out of the fire swamp when it comes to data persistence and why these apps seem to be breaking down more and more. This article appeared originally on Adrian’s blog. Peering into the mist In Part I we examined […]

Read more

In this article, Sunila Gollapudi, author of Practical Machine Learning, introduces the key aspects of machine learning semantics and various toolkit options in Python. Machine learning has been around for many years now and all of us, at some point in time, have been consumers of machine learning technology. One of the most common examples […]

Read more

With release of Spark 1.5, with better performance, usability, and operational stability, many predicted a further droop in MapReduce mind share. And it seems those dark mutterings may have had a kernel of truth. A survey released today by Spark warders Databricks – the company founded by the creators of the hot young data processing technology […]

Read more

Appearing originally on his blog, Adrian Colyer embarks on a journey to get out of the fire swamp when it comes to data persistence and why these apps seem to be breaking down more and more. (*) with apologies to Moseley, Marks, and Westley. Something a little different to the regular paper reviews for the […]

Read more

In this article by Robert Craig Layton, author of Learning Data Mining with Python, we will look at predicting the winner of games of the National Basketball Association (NBA) using a different type of classification algorithm—decision trees. Collecting the data The data we will be using is the match history data for the NBA, for […]

Read more

I came across this post by Marko Karppinen, CEO and Founder of Richie. If you only read one part, it should be this quote: “You can’t expect the value of data to just appear out of thin air. Data isn’t fissile material. It doesn’t spontaneously reach critical mass and start producing insights.” By Marko Karppinen If […]

Read more