Stuff the internet says about data for October 10th - October 18th, 2014

Some quite interesting observations this week are that it seems MongoDB is finding it difficult to live in companies that needs scaling and flexible analytics capability. The overall trend seem to be that persistence technologies that focus on scale like Cloudera, Cassandra and HBase are complimented by analytical engines like Neo4J, ElasticSearch, Spark, Storm etc. The Swiss army knife is maybe not suitable for building skyscrapers after all.

As usual there are plenty of reminders that data is important and that organizations that does not embrace the data field is doomed. Some marginal discussions also cover methodical and organizational approaches to get the grip of the data field. This is in many cases more interesting than throwing a tool at the problem and hope for magical solutions. It will be very interesting to see what the future brings in this area.


Cassandra error handling done right: #nosql via @DataStax

If you are write heavy. Never shard on date or incrementing ids: #nosql via @dzone

Which companies have moved away from MongoDB and why? via @Quora

Apache Kafka Integration #bigdata #nosql via @cloudera

Getting Started with Time Series Data Modeling #nosql #cassandra @PlanetCassandra

Facebook's greatest technical accomplishments: Consistency across data centers ++ via @Quora #bigdata


Linkedin opens the Economic Graph challenge

12 things I hate about Hadoop via @infoworld

".. seek out only the data you need to address it and apply sophisticated predictive and prescriptive analytics" via @Data_Informed

RT @foundsays: Elasticsearch from the Top Down - New article by @alexbrasetvik
Go grab a coffee and put on your diving googles!

RT @jboner: Wired on the #Spark world record: 'Startup Crunches 100 Terabytes of Data in a Record 23 Minutes':

Mobile Video Big Data Architecture with Spring XD/Hadoop/HAWQ/Redis: Measuring Live Usage

RT @monowai: #Ebola tweets via @FlockDataCom. Thanks @halffinn. #NLP co-occur relationships vized in #D3js via #Neo4j

RT @KirkDBorne: As #IoT looms, survey finds growing urgency among companies to adopt #BigData and Predictive #Analytics


Use data or be data: #bigdata via @BigDataStartups

The Future of Graph Visualization (would have been great to be there!)

Facebook's #bigdata mistake (but honestly: You need to test your data assumptions) via @rakeshlobster

Using Data for Good: Jake Porway Talk: #bigdata

Moneyball: How businesses are using data to outsmart their rivals

Never judge a visualization by its bubbles:


Tarjei Romtveit

Co-founder of Monokkel with solid experience in systems design, data management, data analysis, software development and agile processes.