Stuff the internet says about data for October 19th - October 26th, 2014

This week has been a very eventful week in the data space. Google is scoping up Firebase and two UK based machine learning companies. We guess that these mergers and acquisitions are just the tip of the iceberg on what to come over the next few years. This week also a professor wrote a quite nice summary of the littereature about Facebooks data architecture and how they scale their peta sized data-sets. Very interesing read +++

This weeks stories and links:

Persistence

15 timeless data science articles http://www.datasciencecentral.com/m/blogpost?id=6448529%3ABlogPost%3A216421 #bigdata

Replaces Oracle with Cassandra to handle a 600% increase in data at an eighth of the price http://planetcassandra.org/blog/interview/iovation-replaces-oracle-with-cassandra-to-handle-a-600-increase-in-data-at-an-eighth-of-the-price/ #nosql

@astensby has been on a little roadshow presenting Neo4j. http://blog.monokkel.io/the-neo4j-roadshow/ (Slides in Norwegian)

New Book on Time Series http://www.datasciencecentral.com/m/group/discussion?id=6448529%3ATopic%3A216325 @DataScienceCtrl #nosql

Switching From MongoDB to Neo4j: http://java.dzone.com/articles/switching-mongodb-neo4j #nosql

Using Cassandra as a queue (not recommended): http://lostechies.com/ryansvihla/2014/10/20/domain-modeling-around-deletes-or-using-cassandra-as-a-queue-even-when-you-know-better/

Firebase is Joining Google! https://www.firebase.com/blog/2014-10-21-firebase-joins-google.html

How Facebook tiers their #BIGData storage and more... http://muratbuffalo.blogspot.no/2014/10/facebooks-software-architecture.html?m=1 via @muratdemirbas

Processing

RT @analyticbridge: Why Zipf's law explains so many #bigdata and physics phenomena: http://t.co/hdd43PcfcZ

Model Selection Tips From Competitive Machine Learning http://machinelearningmastery.com/model-selection-tips-from-competitive-machine-learning/ #bigdata

LinkedIn and Twitter Contribute Machine Learning Libraries to Open Source http://www.infoq.com/news/2014/10/LinkedIn-Twitter-ML-Open-Source

5 Reasons Organizations Use Hadoop http://smartdatacollective.com/bigdatastartups/277766/what-hadoop-and-five-reasons-organisations-use-hadoop-infographic

Musicians Play Moneyball: #BigData Revolutionizes Another Industry http://www.forbes.com/sites/abrambrown/2014/10/20/musicians-play-moneyball-data-revolutionizes-another-industry/ via @Forbes

Hortonworks adds object storage and spiffier query language to Hadoop http://siliconangle.com/blog/2014/10/17/hortonworks-adds-object-storage-and-spiffier-query-language-to-hadoop/ #bigdata

Google buys two more UK artificial intelligence startups http://www.theguardian.com/technology/2014/oct/23/google-uk-artificial-intelligence-startups-machine-learning-dark-blue-labs-vision-factory

Presentation

The Variable Tree: Anatomy of an Emerging Knowledge Network: The Zapnito Graph Vizualized: http://vartree.blogspot.com/2014/10/anatomy-of-emerging-knowledge-network.html

25 Maps that describe america: http://mentalfloss.com/article/59646/25-maps-describe-america #visualization

Small multiple maps using d3 http://blog.webkid.io/multiple-maps-d3/ #visualization

Author

Tarjei Romtveit

Co-founder of Monokkel with solid experience in systems design, data management, data analysis, software development and agile processes.