Hadoop

Skymind Global Ventures launches $800M fund and London office to back AI startups

Skymind Global Ventures (SGV) appeared last year in Asia/UK as a vehicle for the previous founders of a YC-backed open-source AI platform to invest in companies that used the platform. Today it announ

Starburst raises $22M to modernize data analytics with Presto

Starburst, the company that’s looking to monetize the open-source Presto distributed query engine, today announced that it has raised a $22 million funding round led by Index Ventures, with the

Datameer announces $40M investment as it pivots away from Hadoop roots

Datameer, the company that was born as a data prep startup on top of the open-source Hadoop project, announced a $40 million investment and a big pivot away from Hadoop, while staying true to its big

Databricks brings its Delta Lake project to the Linux Foundation

Databricks, the big data analytics service founded by the original developers of Apache Spark, today announced that it is bringing its Delta Lake open-source project for building data lakes to the Lin

Google brings Cloud Dataproc to Kubernetes

Cloud Dataproc is probably one of the lesser-known products in Google Cloud’s portfolio, but it’s a powerful tool for data wranglers who are looking for a fully managed cloud service that lets the

With MapR fire sale, Hadoop’s promise has fallen on hard times

If you go back about a decade, Hadoop was hot and getting hotter. It was a platform for processing big data, just as big data was emerging from the domain of a few web-scale companies to one where eve

Qubole launches Quantum, its serverless database engine

Qubole, the data platform founded by Apache Hive creator and former head of Facebook’s Data Infrastructure team Ashish Thusoo, today announced the launch of Quantum, its first serverless offerin

Cloudera and Hortonworks finalize their merger

Cloudera and Hortonworks, two of the biggest players in the Hadoop big data space, today announced that they have finalized their all-stock merger. The new company will use the Cloudera brand and will

Cloudera and Hortonworks announce $5.2 billion merger

Over the years, Hadoop, the once high-flying open-source platform, gave rise to many companies and an ecosystem of vendors emerged. It was long believed that some major companies would emerge from the

Investors place $25M on AtScale to get the big picture of big data

AtScale, a four-year old startup that helps companies get a big-picture view of their big data inside their BI tools, announced a $25 million Series C investment today. The round was led by Atlantic B

Databricks releases serverless platform for Apache Spark along with new library supporting deep learning

Today to kick off Spark Summit, Databricks announced a Serverless Platform for Apache Spark — welcome news for developers looking to reduce time spent on cluster management. The move to simplify d

Cloudera’s IPO will test unicorn valuations

TL;DR: Cloudera's recent IPO filing shows a company with steep losses and rapid revenue growth. Today we'll examine Cloudera's finances and where it fits into the current IPO universe. Why do we care?

Cloudera finally ready for the public stage

When I first met Cloudera CEO Tom Reilly in 2015 at the Intel Capital Summit, we were about to go onstage for a fireside chat to discuss, among other things, Intel's massive investment in his company.

Yahoo supercharges TensorFlow with Apache Spark

Yahoo, model Apache Spark citizen and developer of CaffeOnSpark, which made it easier for developers building deep learning models in Caffe to scale with parallel processing, is open sourcing a new

MXNet accepted to the Apache Incubator

Talend looks to ease big data prep with latest release

Talend, the big data integration vendor that went public last July, announced its winter release today with new tools to help automate data preparation, a sticky problem for enterprise customers. Su

IBM releases DataWorks to give enterprise data a home and a brain

While the gears of research are turning fast developing new methods of machine intelligence, another, perhaps more impactful, trend is brewing in the field. Open source frameworks like Apache Spark

Latest Amazon Elastic MapReduce release supports 16 Hadoop projects

Amazon announced the release of Elastic MapReduce (EMR) 5.0.0 today, which includes, among other things, support for 16 open source Hadoop projects. As AWS continues to hone its various tools to help

Spark fragmentation undermines community

Today the Hadoop distribution war comes down to a final battle between Cloudera’s CDH and Hortonworks’ HDP. That wasn’t always the case. At the peak of the market’s fragmentation, numerous com

Microsoft bets on Apache Spark to power its big data and analytics services

Microsoft today announced that it is making a serious commitment to the open source Apache Spark cluster computing framework. After dipping its toes into the Spark ecosystem last year, the company to
Load More