Agile Computing Authors: Elizabeth White, Carmen Gonzalez, John Mertic, Pat Romanski, Liz McMillan

Blog Feed Post

Cloudera Strengthens Hadoop Security with Acquisition of Gazzang: Builds on additional community efforts to deliver end-to-end security offering


One thing I really love about being in the technology field is watching things get done that just a short while ago seemed impossible. I felt that way again when reading the press release below.  In the early days of production systems built around Apache Hadoop, security was only possible by limiting access to your cluster. Later, more and more security related capabilities were added, including better access control, authentication, auditing, and data provenance. Many players delivered niche solutions for encrypting data, but not so long ago most solutions I saw introduced new weaknesses for each solution.  Then some very positive things started happening.  One is Intel corporation started a deep focus on enhanced security, including creating an open source community activity that leveraged smart design that could leverage Intel Data Protection Technology with AES-NI (Project Rhino) in 2013. Cloudera continued to focus on security and find-grain access control with capabilities like Sentry.  Another very positive development was the application of engineering and security talent by an amazing firm named Gazzang. One of the big advances from Gazzang: well engineered key management.

The news below is the product of many of these factors plus the vision and leadership of very smart people at Gazzang, Intel and Cloudera. The result– something that was absolutely impossible just a few years ago, is now achievable. Security still takes forethought, but the fact that well engineered end to end encryption is now possible is a dramatically positive step.

From: http://ctolink.us/Tbddag

Cloudera Strengthens Hadoop Security with Acquisition of Gazzang


Combines Apache Sentry and Intel’s Project Rhino with Gazzang’s Encryption and Key Management to Build the Industry’s Most Robust End-to-End Security Offering for Hadoop Environments

PALO ALTO, Calif. – June 3, 2014 – Cloudera, a leader in enterprise analytic data management powered by Apache Hadoop™, today announced that it has acquired Gazzang, the big data security experts, to dramatically strengthen its security offerings, building on the roadmap laid out last year when Cloudera first delivered Sentry. Terms of the deal were not disclosed.

The addition will immediately deliver enterprise-grade data encryption and key management, addressing head on the challenges associated with securing and processing sensitive and legally protected data within the Hadoop ecosystem. Thus fulfilling a requirement in myriad compliance regulations like HIPAA-HITECH, PCI-DSS, FERPA and the EU Data Protection Directive.

While Cloudera customers will continue to have a choice of a broad range of cross-platform data protection methods available from Cloudera partners, Cloudera now offers encryption for all data-at-rest stored inside the Hadoop cluster – using an approach that is transparent to applications using the data, thereby minimizing the costs associated with enabling encryption.

Cloudera plans to focus the efforts of the Gazzang team on additional security challenges in Hadoop. The team will become the heart of the Cloudera Center for Security Excellence focusing exclusively on Hadoop security. The Center will focus on:

    • Comprehensive data and cluster security technologies - including “follow the data” authorization and encryption policies riding on Cloudera’s data lineage tracking capabilities.
    • Security testing and certification - including continuous vulnerability assessment, performance optimization, and developing regulatory compliance playbooks.
    • Security ecosystem partner enablement - developing security integration APIs and certifying partner products.

In addition to immediately providing a transparent data-at-rest encryption and key management solution to enterprise customers – addressing one of the biggest gaps in Hadoop security – Cloudera, Intel and Gazzang form a powerful team of big data security and silicon performance optimization expertise that will improve security in core Hadoop through the open source community.

Cloudera is continuing to invest broadly in the open source community to support and accelerate security features into project Rhino—an open source effort founded by Intel in early 2013. Project Rhino is a broad based open source security architecture addressing many of the major pillars of enterprise security including: perimeter security, entitlements and access control and data protection.

“Data security is no longer a checkbox for IT organizations or operations departments, it has become a top business priority,” said Tom Reilly, chief executive officer, Cloudera. “At the same time compliance requirements for protecting data continue to expand in scope where data access comes under scrutiny. We’re entering a whole new era with the rise of the Industrial Internet and the Internet of Things where there is vastly more data being streamed from billions of devices. Centralizing and accessing that net-new data to unlock its value is therefore a challenge when you consider the security requirements. That’s what we’re solving now.”

Simplifying the process of injecting core security features such as encryption and key management into highly scalable environments will enable customers to move beyond test and development workloads to real-world implementations much more quickly and easily. For example, companies that are weighing the value of putting workloads in public cloud environments against security concerns will now be able to move forward by putting in place additional process-based access controls. This limits access to encrypted data only to authorized system functions – rather than specific users or roles – so a cloud administrator, who likely does not need access to the sensitive encrypted data, cannot run commands that grant them access. This is critical for compliance initiatives that require organizations to restrict data access based on “business need to know.”

“Enterprises are adopting big data solutions, despite what some mainstream press has stated, but only when they can address data security and compliance requirements. That Cloudera can now address the enterprise’s most critical security requirement — data encryption — directly into the platform is a big win for security-sensitive customers,” said Adrian Lane of the analyst firm Securosis. “What’s more, Gazzang’s transparent form of encryption scales right along with NoSQL clusters, so Cloudera customers get data security at big data scale. This is an astute acquisition by Cloudera.”

Today a rapidly growing number of large enterprises are building enterprise data hubs built on Hadoop to address a wide variety of data challenges and increasingly to work with data in more ways, not only for processing and archiving, but now for self-service BI and advanced analytics. The success of Hadoop has also drawn the attention of big, established players in the market, including most leading enterprise software companies. Many with decades of experience serving large and demanding customers now are building out software and systems that incorporate Hadoop.

Cloudera has driven enterprise capabilities and more power into the Hadoop platform than any other company as evidenced by the incorporation of real- time query with its open source Cloudera Impala; real-time search support with Lucene and Solr; security with Cloudera’s Apache Sentry project; integrated governance, compliance, reporting and disaster recovery—all on to the Hadoop platform.

Cloudera plans to incorporate Gazzang’s technology into its Cloudera Enterprise offering. Existing customers will benefit immediately as the new products become part of the company’s existing offering. Cloudera will provide support for the Gazzang customer base.


About Gazzang

Gazzang provides data security solutions and expertise to help enterprises protect sensitive information and maintain performance in big data and cloud environments. Our technology enables SaaS vendors, health care organizations, financial institutions, public sector agencies and more to meet regulatory compliance initiatives, secure personally identifiable information and prevent unauthorized access to sensitive data and systems. The company is headquartered in Austin, Texas and backed by Austin Ventures and Silver Creek Ventures.

About Cloudera

Cloudera is revolutionizing enterprise data management by offering the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data. Only Cloudera offers everything needed on a journey to an enterprise data hub, including software for business critical data challenges such as storage, access, management, analysis, security and search. As the leading educator of Hadoop professionals, Cloudera has trained over 22,000 individuals worldwide. Over 1,000 partners and a seasoned professional services team help deliver greater time to value. Finally, only Cloudera provides proactive and predictive support to run an enterprise data hub with confidence. Leading organizations in every industry plus top public sector organizations globally run Cloudera in production.www.cloudera.com

Connect with Cloudera

Read our blogs: http://www.cloudera.com/blog/ andhttp://vision.cloudera.com/

Follow us on Twitter:http://twitter.com/cloudera

Visit us on Facebook:http://www.facebook.com/cloudera

Cloudera, Cloudera’s Platform for Big Data, Cloudera Enterprise Data Hub Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Basic Edition and CDH are trademarks or registered trademarks of Cloudera Inc. in the United States, and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley writes on enterprise IT. He is a founder and partner at Cognitio Corp and publsher of CTOvision.com

@ThingsExpo Stories
More and more brands have jumped on the IoT bandwagon. We have an excess of wearables – activity trackers, smartwatches, smart glasses and sneakers, and more that track seemingly endless datapoints. However, most consumers have no idea what “IoT” means. Creating more wearables that track data shouldn't be the aim of brands; delivering meaningful, tangible relevance to their users should be. We're in a period in which the IoT pendulum is still swinging. Initially, it swung toward "smart for smar...
Virgil consists of an open-source encryption library, which implements Cryptographic Message Syntax (CMS) and Elliptic Curve Integrated Encryption Scheme (ECIES) (including RSA schema), a Key Management API, and a cloud-based Key Management Service (Virgil Keys). The Virgil Keys Service consists of a public key service and a private key escrow service. 

The Internet of Things (IoT), in all its myriad manifestations, has great potential. Much of that potential comes from the evolving data management and analytic (DMA) technologies and processes that allow us to gain insight from all of the IoT data that can be generated and gathered. This potential may never be met as those data sets are tied to specific industry verticals and single markets, with no clear way to use IoT data and sensor analytics to fulfill the hype being given the IoT today.
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
What happens when the different parts of a vehicle become smarter than the vehicle itself? As we move toward the era of smart everything, hundreds of entities in a vehicle that communicate with each other, the vehicle and external systems create a need for identity orchestration so that all entities work as a conglomerate. Much like an orchestra without a conductor, without the ability to secure, control, and connect the link between a vehicle’s head unit, devices, and systems and to manage the ...
The best way to leverage your Cloud Expo presence as a sponsor and exhibitor is to plan your news announcements around our events. The press covering Cloud Expo and @ThingsExpo will have access to these releases and will amplify your news announcements. More than two dozen Cloud companies either set deals at our shows or have announced their mergers and acquisitions at Cloud Expo. Product announcements during our show provide your company with the most reach through our targeted audiences.
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, discussed how research has demonstrated the value of Machine Learning in delivering next generation analytics to impr...
@ThingsExpo has been named the Top 5 Most Influential Internet of Things Brand by Onalytica in the ‘The Internet of Things Landscape 2015: Top 100 Individuals and Brands.' Onalytica analyzed Twitter conversations around the #IoT debate to uncover the most influential brands and individuals driving the conversation. Onalytica captured data from 56,224 users. The PageRank based methodology they use to extract influencers on a particular topic (tweets mentioning #InternetofThings or #IoT in this ...
For basic one-to-one voice or video calling solutions, WebRTC has proven to be a very powerful technology. Although WebRTC’s core functionality is to provide secure, real-time p2p media streaming, leveraging native platform features and server-side components brings up new communication capabilities for web and native mobile applications, allowing for advanced multi-user use cases such as video broadcasting, conferencing, and media recording.
Amazon has gradually rolled out parts of its IoT offerings, but these are just the tip of the iceberg. In addition to optimizing their backend AWS offerings, Amazon is laying the ground work to be a major force in IoT - especially in the connected home and office. In his session at @ThingsExpo, Chris Kocher, founder and managing director of Grey Heron, explained how Amazon is extending its reach to become a major force in IoT by building on its dominant cloud IoT platform, its Dash Button strat...
SYS-CON Events announced today that SoftNet Solutions will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. SoftNet Solutions specializes in Enterprise Solutions for Hadoop and Big Data. It offers customers the most open, robust, and value-conscious portfolio of solutions, services, and tools for the shortest route to success with Big Data. The unique differentiator is the ability to architect and ...
A critical component of any IoT project is what to do with all the data being generated. This data needs to be captured, processed, structured, and stored in a way to facilitate different kinds of queries. Traditional data warehouse and analytical systems are mature technologies that can be used to handle certain kinds of queries, but they are not always well suited to many problems, particularly when there is a need for real-time insights.
DevOps is being widely accepted (if not fully adopted) as essential in enterprise IT. But as Enterprise DevOps gains maturity, expands scope, and increases velocity, the need for data-driven decisions across teams becomes more acute. DevOps teams in any modern business must wrangle the ‘digital exhaust’ from the delivery toolchain, "pervasive" and "cognitive" computing, APIs and services, mobile devices and applications, the Internet of Things, and now even blockchain. In this power panel at @...
One of biggest questions about Big Data is “How do we harness all that information for business use quickly and effectively?” Geographic Information Systems (GIS) or spatial technology is about more than making maps, but adding critical context and meaning to data of all types, coming from all different channels – even sensors. In his session at @ThingsExpo, William (Bill) Meehan, director of utility solutions for Esri, will take a closer look at the current state of spatial technology and ar...
Everyone knows that truly innovative companies learn as they go along, pushing boundaries in response to market changes and demands. What's more of a mystery is how to balance innovation on a fresh platform built from scratch with the legacy tech stack, product suite and customers that continue to serve as the business' foundation. In his General Session at 19th Cloud Expo, Michael Chambliss, Head of Engineering at ReadyTalk, will discuss why and how ReadyTalk diverted from healthy revenue an...
You have great SaaS business app ideas. You want to turn your idea quickly into a functional and engaging proof of concept. You need to be able to modify it to meet customers' needs, and you need to deliver a complete and secure SaaS application. How could you achieve all the above and yet avoid unforeseen IT requirements that add unnecessary cost and complexity? You also want your app to be responsive in any device at any time. In his session at 19th Cloud Expo, Mark Allen, General Manager of...
SYS-CON Media announced today that @WebRTCSummit Blog, the largest WebRTC resource in the world, has been launched. @WebRTCSummit Blog offers top articles, news stories, and blog posts from the world's well-known experts and guarantees better exposure for its authors than any other publication. @WebRTCSummit Blog can be bookmarked ▸ Here @WebRTCSummit conference site can be bookmarked ▸ Here
SYS-CON Events announced today that Streamlyzer will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Streamlyzer is a powerful analytics for video streaming service that enables video streaming providers to monitor and analyze QoE (Quality-of-Experience) from end-user devices in real time.
In past @ThingsExpo presentations, Joseph di Paolantonio has explored how various Internet of Things (IoT) and data management and analytics (DMA) solution spaces will come together as sensor analytics ecosystems. This year, in his session at @ThingsExpo, Joseph di Paolantonio from DataArchon, will be adding the numerous Transportation areas, from autonomous vehicles to “Uber for containers.” While IoT data in any one area of Transportation will have a huge impact in that area, combining sensor...
Almost everyone sees the potential of Internet of Things but how can businesses truly unlock that potential. The key will be in the ability to discover business insight in the midst of an ocean of Big Data generated from billions of embedded devices via Systems of Discover. Businesses will also need to ensure that they can sustain that insight by leveraging the cloud for global reach, scale and elasticity.