Agile Computing Authors: Elizabeth White, Liz McMillan, Carmen Gonzalez, Jim Kaskade, Lori MacVittie

News Feed Item

Cloudera Search Now Generally Available for Open Source Users and Enterprise Subscribers

New Real-Time Search Subscription Add-On Allows Cloudera Customers to Get Maximum Value From the Industry's First Fully Integrated Search Solution for Hadoop

PALO ALTO, CA -- (Marketwired) -- 09/05/13 -- Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™, today announced the general availability of Cloudera Search and the accompanying add-on RTS (Real-time Search) subscription. Cloudera Search is the industry's first fully integrated search engine for interactive exploration of data stored in the Hadoop Distributed File System (HDFS) and Apache HBase™. The RTS subscription enables customers to more effectively leverage Cloudera Search by providing technical support, legal indemnification and continual influence over the development of the open source project.

The Next Generation of Search for Enterprise Users of Hadoop
For years, databases attempted to provide search as a feature in their platforms but this approach was largely abandoned in favor of acquiring independent search products that require their own infrastructure, integration and expertise. Hadoop's flexibility makes it well suited for search, and consequently, a better general-purpose platform for data exploration than relational databases. Cloudera Search enables anyone within an organization to perform interactive, natural language keyword searches and faceted navigation without additional training or advanced programming knowledge, so both technical and non-technical business users can explore and analyze data in Hadoop.

Released as a public beta offering in June 2013, Cloudera Search was the first enterprise-ready search-on-Hadoop solution on the market. Since that time, the company has worked closely with enterprise customers, open source users and technology partners, rigorously testing and refining the platform in real world applications to deliver today's production-hardened and customer validated 1.0 release, designed from the ground-up for mission critical workloads.

Cloudera Search is specifically designed to support business users in quickly and efficiently locating relevant data stored in Hadoop for further processing and analysis and is fully integrated with the CDH platform. Key features include:

  • Scalable, Reliable Index Storage in HDFS: integrates index storage and serving directly into HDFS.
  • Batch Indexing via MapReduce: allows for scalable and robust index creation of data stored in HDFS and HBase that is comparable to MapReduce.
  • Real-time Indexing at Collection: makes events searchable as it they are stored in HDFS and HBase through near real-time indexing features, powered by Apache Flume™ and the Lily HBase Indexer.
  • Easy Interaction and Data Exploration via Cloudera Hue: offers plug-in application and easy-to-install capabilities for standard Hue servers to query data and view result files, enabling faceted exploration.
  • Simplified Field Extraction and Cross-Platform Data Processing: enables quick and easy field extraction of any data that is stored in HDFS using optimized Hadoop file formats, such as Apache Avro ™. Users can avoid the pain that many standalone search solutions impose, by promoting reusable configurations and processing activities with the new processing framework, Cloudera Morphlines
  • Unified Management and Monitoring with Cloudera Manager: provides a centralized management and monitoring experience that makes it as easy to deploy, configure, and monitor search services as it is to manage CDH deployments and other services on the Hadoop cluster.

Maximize the Value of Cloudera Search with an RTS Subscription
The RTS (Real-time Search) subscription is the best way to leverage the power of Cloudera Search, offering technical support, legal indemnification and continual influence over the development of the open source project. With an RTS subscription, customers can get up and running more quickly, resolve issues more effectively and ensure that the technology remains in alignment with the strategic objectives of their big data deployment.

"As enterprise Hadoop deployments continue to mature, becoming primary repositories for more and more types of data, the center of gravity for data management continues to make a meaningful shift toward Hadoop," said Charles Zedlewski, vice president, Products, Cloudera. "We've taken what was once a relatively complicated and involved freestanding system, requiring its own hardware and operational model, and turned it into a feature of a larger, more ubiquitous open source platform -- CDH. We believe this integrated approach represents a big step forward for users of both Solr and Hadoop. With Cloudera Search, Hadoop deployments can now be explored with the same ease of use and speed as a simple Google search engine query, empowering our customers to achieve rapid insights from a fully integrated platform."

"For too long, the power of data has been available only to technical users in the enterprise. To fully unlock the potential of Hadoop, data needs to be available and consumable by workers beyond IT and across the organization," said Justin Langseth, chief executive officer and founder, Zoomdata. "Through our integration with Impala and Cloudera Search, Zoomdata customers can turn big datasets and streams into compelling, interactive visualizations that makes information accessible to anyone. Our partnership with Cloudera gives our mutual customers the ability to see, analyze and explore their data in real time and put it to work, regardless of their technical skill level. Everyday business users now have the power to find information and perform analytics on billions of rows of raw data from almost on any device without the need for additional training."

Learn More: Zoomdata Leverages Cloudera Enterprise RTS to Democratize Hadoop Search for Its Customers
Watch a video demo from Cloudera partner Zoomdata to see how the company has integrated its solution with Cloudera Search to simplify the creation of data visualizations: http://youtu.be/yALNUmicadg

Product Availability
Cloudera Search 1.0 is immediately available to open source users and can be downloaded for free at www.cloudera.com/downloads. Cloudera RTS is immediately available to Cloudera Enterprise subscribers, as a supplemental module. For more information, visit www.cloudera.com/search.

About Cloudera
Founded in 2008, Cloudera pioneered the business case for Hadoop with CDH: the world's most comprehensive, thoroughly tested and widely deployed 100% open source distribution of Apache Hadoop in both commercial and non-commercial environments. Now, the company is redefining data management with its Platform for Big Data, Cloudera Enterprise, empowering enterprises to Ask Bigger Questions™ and gain rich, actionable insights from all their data, to quickly and easily derive real business value that translates into competitive advantage. As the top contributor to the Apache open source community and leading educator of data professionals with the broadest array of Hadoop training and certification programs, Cloudera also offers comprehensive consulting services. Over 700 partners across hardware, software and services have teamed with Cloudera to help meet organizations' big data goals. With tens of thousands of nodes under management and hundreds of customers across diverse markets, Cloudera is the category leader that has set the standard for Hadoop in the enterprise. www.cloudera.com

Connect with Cloudera
Read the blog: http://www.cloudera.com/blog/
Follow on Twitter: http://twitter.com/cloudera
Visit on Facebook: http://www.facebook.com/cloudera

Add to Digg Bookmark with del.icio.us Add to Newsvine

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

@ThingsExpo Stories
SYS-CON Events announced today that 910Telecom will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Housed in the classic Denver Gas & Electric Building, 910 15th St., 910Telecom is a carrier-neutral telecom hotel located in the heart of Denver. Adjacent to CenturyLink, AT&T, and Denver Main, 910Telecom offers connectivity to all major carriers, Internet service providers, Internet backbones and ...
SYS-CON Events announced today that Coalfire will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Coalfire is the trusted leader in cybersecurity risk management and compliance services. Coalfire integrates advisory and technical assessments and recommendations to the corporate directors, executives, boards, and IT organizations for global brands and organizations in the technology, cloud, health...
SYS-CON Events announced today that Transparent Cloud Computing (T-Cloud) Consortium will exhibit at the 19th International Cloud Expo®, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. The Transparent Cloud Computing Consortium (T-Cloud Consortium) will conduct research activities into changes in the computing model as a result of collaboration between "device" and "cloud" and the creation of new value and markets through organic data proces...
The Internet of Things (IoT), in all its myriad manifestations, has great potential. Much of that potential comes from the evolving data management and analytic (DMA) technologies and processes that allow us to gain insight from all of the IoT data that can be generated and gathered. This potential may never be met as those data sets are tied to specific industry verticals and single markets, with no clear way to use IoT data and sensor analytics to fulfill the hype being given the IoT today.
WebRTC defines no default signaling protocol, causing fragmentation between WebRTC silos. SIP and XMPP provide possibilities, but come with considerable complexity and are not designed for use in a web environment. In his session at @ThingsExpo, Matthew Hodgson, technical co-founder of the Matrix.org, discussed how Matrix is a new non-profit Open Source Project that defines both a new HTTP-based standard for VoIP & IM signaling and provides reference implementations.
In his general session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed cloud as a ‘better data center’ and how it adds new capacity (faster) and improves application availability (redundancy). The cloud is a ‘Dynamic Tool for Dynamic Apps’ and resource allocation is an integral part of your application architecture, so use only the resources you need and allocate /de-allocate resources on the fly.
We're entering the post-smartphone era, where wearable gadgets from watches and fitness bands to glasses and health aids will power the next technological revolution. With mass adoption of wearable devices comes a new data ecosystem that must be protected. Wearables open new pathways that facilitate the tracking, sharing and storing of consumers’ personal health, location and daily activity data. Consumers have some idea of the data these devices capture, but most don’t realize how revealing and...
A completely new computing platform is on the horizon. They’re called Microservers by some, ARM Servers by others, and sometimes even ARM-based Servers. No matter what you call them, Microservers will have a huge impact on the data center and on server computing in general. Although few people are familiar with Microservers today, their impact will be felt very soon. This is a new category of computing platform that is available today and is predicted to have triple-digit growth rates for some ...
SYS-CON Events announced today that MathFreeOn will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. MathFreeOn is Software as a Service (SaaS) used in Engineering and Math education. Write scripts and solve math problems online. MathFreeOn provides online courses for beginners or amateurs who have difficulties in writing scripts. In accordance with various mathematical topics, there are more tha...
In past @ThingsExpo presentations, Joseph di Paolantonio has explored how various Internet of Things (IoT) and data management and analytics (DMA) solution spaces will come together as sensor analytics ecosystems. This year, in his session at @ThingsExpo, Joseph di Paolantonio from DataArchon, will be adding the numerous Transportation areas, from autonomous vehicles to “Uber for containers.” While IoT data in any one area of Transportation will have a huge impact in that area, combining sensor...
SYS-CON Events announced today that SoftNet Solutions will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. SoftNet Solutions specializes in Enterprise Solutions for Hadoop and Big Data. It offers customers the most open, robust, and value-conscious portfolio of solutions, services, and tools for the shortest route to success with Big Data. The unique differentiator is the ability to architect and ...
The best way to leverage your Cloud Expo presence as a sponsor and exhibitor is to plan your news announcements around our events. The press covering Cloud Expo and @ThingsExpo will have access to these releases and will amplify your news announcements. More than two dozen Cloud companies either set deals at our shows or have announced their mergers and acquisitions at Cloud Expo. Product announcements during our show provide your company with the most reach through our targeted audiences.
More and more brands have jumped on the IoT bandwagon. We have an excess of wearables – activity trackers, smartwatches, smart glasses and sneakers, and more that track seemingly endless datapoints. However, most consumers have no idea what “IoT” means. Creating more wearables that track data shouldn't be the aim of brands; delivering meaningful, tangible relevance to their users should be. We're in a period in which the IoT pendulum is still swinging. Initially, it swung toward "smart for smar...
@ThingsExpo has been named the Top 5 Most Influential Internet of Things Brand by Onalytica in the ‘The Internet of Things Landscape 2015: Top 100 Individuals and Brands.' Onalytica analyzed Twitter conversations around the #IoT debate to uncover the most influential brands and individuals driving the conversation. Onalytica captured data from 56,224 users. The PageRank based methodology they use to extract influencers on a particular topic (tweets mentioning #InternetofThings or #IoT in this ...
SYS-CON Events announced today that Niagara Networks will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Niagara Networks offers the highest port-density systems, and the most complete Next-Generation Network Visibility systems including Network Packet Brokers, Bypass Switches, and Network TAPs.
In an era of historic innovation fueled by unprecedented access to data and technology, the low cost and risk of entering new markets has leveled the playing field for business. Today, any ambitious innovator can easily introduce a new application or product that can reinvent business models and transform the client experience. In their Day 2 Keynote at 19th Cloud Expo, Mercer Rowe, IBM Vice President of Strategic Alliances, and Raejeanne Skillern, Intel Vice President of Data Center Group and ...
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
Virgil consists of an open-source encryption library, which implements Cryptographic Message Syntax (CMS) and Elliptic Curve Integrated Encryption Scheme (ECIES) (including RSA schema), a Key Management API, and a cloud-based Key Management Service (Virgil Keys). The Virgil Keys Service consists of a public key service and a private key escrow service. 

Fact is, enterprises have significant legacy voice infrastructure that’s costly to replace with pure IP solutions. How can we bring this analog infrastructure into our shiny new cloud applications? There are proven methods to bind both legacy voice applications and traditional PSTN audio into cloud-based applications and services at a carrier scale. Some of the most successful implementations leverage WebRTC, WebSockets, SIP and other open source technologies. In his session at @ThingsExpo, Da...
Fifty billion connected devices and still no winning protocols standards. HTTP, WebSockets, MQTT, and CoAP seem to be leading in the IoT protocol race at the moment but many more protocols are getting introduced on a regular basis. Each protocol has its pros and cons depending on the nature of the communications. Does there really need to be only one protocol to rule them all? Of course not. In his session at @ThingsExpo, Chris Matthieu, co-founder and CTO of Octoblu, walk you through how Oct...