Our website makes use of cookies like most of the websites. In order to deliver a personalised, responsive and improved experience, we remember and store information about how you use it. This is done using simple text files called cookies which sit on your computer. These cookies are completely safe and secure and will never contain any sensitive information. By clicking continue here, you give your consent to the use of cookies by our website.

Thursday, 04 June 2015 13:02

BlueData supports Hadoop and Spark on Docker containers

Written by 

New features in BlueData allows users to spin up big data clusters in minutes on their laptop


Infrastructure software platform for Big Data, BlueData has announced support for Docker containers.  BlueData’s mission is to make it easier to deploy Big Data infrastructure on-premises, and has opted to support Docker’s open source container technology. With the new BlueData EPIC platform running on Docker, organisations will be able to deploy Apache Hadoop or Apache Spark in a lightweight container environment.  Enabling users to download BlueData EPIC Lite – a free version of the EPIC platform available immediately – and spin up virtual Hadoop or Spark clusters in Docker containers on their laptop.

“Container technology is disrupting the IT market, and Docker is seeing rapid enterprise adoption,” said Kumar Sreekanti, co-founder and CEO of BlueData. “There are now thousands of applications running on Docker, but until today there were only a few applications supporting containers for Big Data analytics.  Together with Docker, we’re disrupting the Big Data market and leveraging the power of containers for enterprises deploying Hadoop and Spark on-premises.”

The BlueData EPIC software platform works with all of the major Hadoop distributions as well as Spark and integrates with the leading analytical applications, and runs with any shared storage environment.

BlueData is working with Docker as well as Intel and an ecosystem of partners to ensure enterprise-class security, performance, and scalability for Big Data applications running in containers.  In collaboration with these partners, BlueData will provide enterprises with an on-premises, multi-tenant solution to run large-scale data processing environments such as Hadoop or Spark on Docker. 

“Intel works closely with the Apache Hadoop and Apache Spark communities and their ecosystems to drive the foundation for Big Data and analytics in the enterprise,” said Michael Greene, Intel Vice President and General Manager of System Technologies and Optimization in Intel’s Software and Services Group. “With BlueData’s support for container technology to enable easier deployment of Hadoop and Spark, we believe BlueData can deliver even greater simplicity and agility in a virtualized environment – while providing the security and performance that enterprise IT organizations require for their Big Data infrastructure.”

The initial BlueData software release supporting Docker is available today as a free edition: BlueData EPIC Lite. With EPIC Lite users can create multi-node Hadoop clusters (including key components such as Hive, Hue, Impala, and Pig) or standalone Spark clusters running in Docker containers. They can point to data in their local files or against existing HDFS and NFS storage.  Within a matter of minutes, they can develop and test Big Data analytics in a personal sandbox on their laptop. 

EPIC Lite is available for download on a developer laptop or as a hosted instance on the Amazon Web Services Elastic Compute Cloud (EC2). Docker support for the enterprise edition of the BlueData EPIC software platform will be generally available for production deployments in the fall of 2015.  In a separate announcement today, BlueData also introduced the new summer release of the EPIC software platform.

Leave a comment

Make sure you enter the (*) required information where indicated. HTML code is not allowed.



255x635 banner2-compressed