Click here to close now.

Welcome!

Web 2.0 Authors: Trevor Parsons, Dana Gardner, Ian Khan, Plutora Blog, Lori MacVittie

News Feed Item

Cloudera Search Now Generally Available for Open Source Users and Enterprise Subscribers

New Real-Time Search Subscription Add-On Allows Cloudera Customers to Get Maximum Value From the Industry's First Fully Integrated Search Solution for Hadoop

PALO ALTO, CA -- (Marketwired) -- 09/05/13 -- Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™, today announced the general availability of Cloudera Search and the accompanying add-on RTS (Real-time Search) subscription. Cloudera Search is the industry's first fully integrated search engine for interactive exploration of data stored in the Hadoop Distributed File System (HDFS) and Apache HBase™. The RTS subscription enables customers to more effectively leverage Cloudera Search by providing technical support, legal indemnification and continual influence over the development of the open source project.

The Next Generation of Search for Enterprise Users of Hadoop
For years, databases attempted to provide search as a feature in their platforms but this approach was largely abandoned in favor of acquiring independent search products that require their own infrastructure, integration and expertise. Hadoop's flexibility makes it well suited for search, and consequently, a better general-purpose platform for data exploration than relational databases. Cloudera Search enables anyone within an organization to perform interactive, natural language keyword searches and faceted navigation without additional training or advanced programming knowledge, so both technical and non-technical business users can explore and analyze data in Hadoop.

Released as a public beta offering in June 2013, Cloudera Search was the first enterprise-ready search-on-Hadoop solution on the market. Since that time, the company has worked closely with enterprise customers, open source users and technology partners, rigorously testing and refining the platform in real world applications to deliver today's production-hardened and customer validated 1.0 release, designed from the ground-up for mission critical workloads.

Cloudera Search is specifically designed to support business users in quickly and efficiently locating relevant data stored in Hadoop for further processing and analysis and is fully integrated with the CDH platform. Key features include:

  • Scalable, Reliable Index Storage in HDFS: integrates index storage and serving directly into HDFS.
  • Batch Indexing via MapReduce: allows for scalable and robust index creation of data stored in HDFS and HBase that is comparable to MapReduce.
  • Real-time Indexing at Collection: makes events searchable as it they are stored in HDFS and HBase through near real-time indexing features, powered by Apache Flume™ and the Lily HBase Indexer.
  • Easy Interaction and Data Exploration via Cloudera Hue: offers plug-in application and easy-to-install capabilities for standard Hue servers to query data and view result files, enabling faceted exploration.
  • Simplified Field Extraction and Cross-Platform Data Processing: enables quick and easy field extraction of any data that is stored in HDFS using optimized Hadoop file formats, such as Apache Avro ™. Users can avoid the pain that many standalone search solutions impose, by promoting reusable configurations and processing activities with the new processing framework, Cloudera Morphlines
  • Unified Management and Monitoring with Cloudera Manager: provides a centralized management and monitoring experience that makes it as easy to deploy, configure, and monitor search services as it is to manage CDH deployments and other services on the Hadoop cluster.

Maximize the Value of Cloudera Search with an RTS Subscription
The RTS (Real-time Search) subscription is the best way to leverage the power of Cloudera Search, offering technical support, legal indemnification and continual influence over the development of the open source project. With an RTS subscription, customers can get up and running more quickly, resolve issues more effectively and ensure that the technology remains in alignment with the strategic objectives of their big data deployment.

"As enterprise Hadoop deployments continue to mature, becoming primary repositories for more and more types of data, the center of gravity for data management continues to make a meaningful shift toward Hadoop," said Charles Zedlewski, vice president, Products, Cloudera. "We've taken what was once a relatively complicated and involved freestanding system, requiring its own hardware and operational model, and turned it into a feature of a larger, more ubiquitous open source platform -- CDH. We believe this integrated approach represents a big step forward for users of both Solr and Hadoop. With Cloudera Search, Hadoop deployments can now be explored with the same ease of use and speed as a simple Google search engine query, empowering our customers to achieve rapid insights from a fully integrated platform."

"For too long, the power of data has been available only to technical users in the enterprise. To fully unlock the potential of Hadoop, data needs to be available and consumable by workers beyond IT and across the organization," said Justin Langseth, chief executive officer and founder, Zoomdata. "Through our integration with Impala and Cloudera Search, Zoomdata customers can turn big datasets and streams into compelling, interactive visualizations that makes information accessible to anyone. Our partnership with Cloudera gives our mutual customers the ability to see, analyze and explore their data in real time and put it to work, regardless of their technical skill level. Everyday business users now have the power to find information and perform analytics on billions of rows of raw data from almost on any device without the need for additional training."

Learn More: Zoomdata Leverages Cloudera Enterprise RTS to Democratize Hadoop Search for Its Customers
Watch a video demo from Cloudera partner Zoomdata to see how the company has integrated its solution with Cloudera Search to simplify the creation of data visualizations: http://youtu.be/yALNUmicadg

Product Availability
Cloudera Search 1.0 is immediately available to open source users and can be downloaded for free at www.cloudera.com/downloads. Cloudera RTS is immediately available to Cloudera Enterprise subscribers, as a supplemental module. For more information, visit www.cloudera.com/search.

About Cloudera
Founded in 2008, Cloudera pioneered the business case for Hadoop with CDH: the world's most comprehensive, thoroughly tested and widely deployed 100% open source distribution of Apache Hadoop in both commercial and non-commercial environments. Now, the company is redefining data management with its Platform for Big Data, Cloudera Enterprise, empowering enterprises to Ask Bigger Questions™ and gain rich, actionable insights from all their data, to quickly and easily derive real business value that translates into competitive advantage. As the top contributor to the Apache open source community and leading educator of data professionals with the broadest array of Hadoop training and certification programs, Cloudera also offers comprehensive consulting services. Over 700 partners across hardware, software and services have teamed with Cloudera to help meet organizations' big data goals. With tens of thousands of nodes under management and hundreds of customers across diverse markets, Cloudera is the category leader that has set the standard for Hadoop in the enterprise. www.cloudera.com

Connect with Cloudera
Read the blog: http://www.cloudera.com/blog/
Follow on Twitter: http://twitter.com/cloudera
Visit on Facebook: http://www.facebook.com/cloudera

Add to Digg Bookmark with del.icio.us Add to Newsvine

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

@ThingsExpo Stories
We certainly live in interesting technological times. And no more interesting than the current competing IoT standards for connectivity. Various standards bodies, approaches, and ecosystems are vying for mindshare and positioning for a competitive edge. It is clear that when the dust settles, we will have new protocols, evolved protocols, that will change the way we interact with devices and infrastructure. We will also have evolved web protocols, like HTTP/2, that will be changing the very core of our infrastructures. At the same time, we have old approaches made new again like micro-services...
Every innovation or invention was originally a daydream. You like to imagine a “what-if” scenario. And with all the attention being paid to the so-called Internet of Things (IoT) you don’t have to stretch the imagination too much to see how this may impact commercial and homeowners insurance. We’re beyond the point of accepting this as a leap of faith. The groundwork is laid. Now it’s just a matter of time. We can thank the inventors of smart thermostats for developing a practical business application that everyone can relate to. Gone are the salad days of smart home apps, the early chalkb...
Operational Hadoop and the Lambda Architecture for Streaming Data Apache Hadoop is emerging as a distributed platform for handling large and fast incoming streams of data. Predictive maintenance, supply chain optimization, and Internet-of-Things analysis are examples where Hadoop provides the scalable storage, processing, and analytics platform to gain meaningful insights from granular data that is typically only valuable from a large-scale, aggregate view. One architecture useful for capturing and analyzing streaming data is the Lambda Architecture, representing a model of how to analyze rea...
Today’s enterprise is being driven by disruptive competitive and human capital requirements to provide enterprise application access through not only desktops, but also mobile devices. To retrofit existing programs across all these devices using traditional programming methods is very costly and time consuming – often prohibitively so. In his session at @ThingsExpo, Jesse Shiah, CEO, President, and Co-Founder of AgilePoint Inc., discussed how you can create applications that run on all mobile devices as well as laptops and desktops using a visual drag-and-drop application – and eForms-buildi...
SYS-CON Events announced today that Vitria Technology, Inc. will exhibit at SYS-CON’s @ThingsExpo, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. Vitria will showcase the company’s new IoT Analytics Platform through live demonstrations at booth #330. Vitria’s IoT Analytics Platform, fully integrated and powered by an operational intelligence engine, enables customers to rapidly build and operationalize advanced analytics to deliver timely business outcomes for use cases across the industrial, enterprise, and consumer segments.
SYS-CON Events announced today that Dyn, the worldwide leader in Internet Performance, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. Dyn is a cloud-based Internet Performance company. Dyn helps companies monitor, control, and optimize online infrastructure for an exceptional end-user experience. Through a world-class network and unrivaled, objective intelligence into Internet conditions, Dyn ensures traffic gets delivered faster, safer, and more reliably than ever.
Containers and microservices have become topics of intense interest throughout the cloud developer and enterprise IT communities. Accordingly, attendees at the upcoming 16th Cloud Expo at the Javits Center in New York June 9-11 will find fresh new content in a new track called PaaS | Containers & Microservices Containers are not being considered for the first time by the cloud community, but a current era of re-consideration has pushed them to the top of the cloud agenda. With the launch of Docker's initial release in March of 2013, interest was revved up several notches. Then late last...
In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect at GE, and Ibrahim Gokcen, who leads GE's advanced IoT analytics, focused on the Internet of Things / Industrial Internet and how to make it operational for business end-users. Learn about the challenges posed by machine and sensor data and how to marry it with enterprise data. They also discussed the tips and tricks to provide the Industrial Internet as an end-user consumable service using Big Data Analytics and Industrial Cloud.
Performance is the intersection of power, agility, control, and choice. If you value performance, and more specifically consistent performance, you need to look beyond simple virtualized compute. Many factors need to be considered to create a truly performant environment. In his General Session at 15th Cloud Expo, Harold Hannon, Sr. Software Architect at SoftLayer, discussed how to take advantage of a multitude of compute options and platform features to make cloud the cornerstone of your online presence.
The explosion of connected devices / sensors is creating an ever-expanding set of new and valuable data. In parallel the emerging capability of Big Data technologies to store, access, analyze, and react to this data is producing changes in business models under the umbrella of the Internet of Things (IoT). In particular within the Insurance industry, IoT appears positioned to enable deep changes by altering relationships between insurers, distributors, and the insured. In his session at @ThingsExpo, Michael Sick, a Senior Manager and Big Data Architect within Ernst and Young's Financial Servi...
Even as cloud and managed services grow increasingly central to business strategy and performance, challenges remain. The biggest sticking point for companies seeking to capitalize on the cloud is data security. Keeping data safe is an issue in any computing environment, and it has been a focus since the earliest days of the cloud revolution. Understandably so: a lot can go wrong when you allow valuable information to live outside the firewall. Recent revelations about government snooping, along with a steady stream of well-publicized data breaches, only add to the uncertainty
The explosion of connected devices / sensors is creating an ever-expanding set of new and valuable data. In parallel the emerging capability of Big Data technologies to store, access, analyze, and react to this data is producing changes in business models under the umbrella of the Internet of Things (IoT). In particular within the Insurance industry, IoT appears positioned to enable deep changes by altering relationships between insurers, distributors, and the insured. In his session at @ThingsExpo, Michael Sick, a Senior Manager and Big Data Architect within Ernst and Young's Financial Servi...
Docker is an excellent platform for organizations interested in running microservices. It offers portability and consistency between development and production environments, quick provisioning times, and a simple way to isolate services. In his session at DevOps Summit at 16th Cloud Expo, Shannon Williams, co-founder of Rancher Labs, will walk through these and other benefits of using Docker to run microservices, and provide an overview of RancherOS, a minimalist distribution of Linux designed expressly to run Docker. He will also discuss Rancher, an orchestration and service discovery platf...
PubNub on Monday has announced that it is partnering with IBM to bring its sophisticated real-time data streaming and messaging capabilities to Bluemix, IBM’s cloud development platform. “Today’s app and connected devices require an always-on connection, but building a secure, scalable solution from the ground up is time consuming, resource intensive, and error-prone,” said Todd Greene, CEO of PubNub. “PubNub enables web, mobile and IoT developers building apps on IBM Bluemix to quickly add scalable realtime functionality with minimal effort and cost.”
Sensor-enabled things are becoming more commonplace, precursors to a larger and more complex framework that most consider the ultimate promise of the IoT: things connecting, interacting, sharing, storing, and over time perhaps learning and predicting based on habits, behaviors, location, preferences, purchases and more. In his session at @ThingsExpo, Tom Wesselman, Director of Communications Ecosystem Architecture at Plantronics, will examine the still nascent IoT as it is coalescing, including what it is today, what it might ultimately be, the role of wearable tech, and technology gaps stil...
CommVault has announced that top industry technology visionaries have joined its leadership team. The addition of leaders from companies such as Oracle, SAP, Microsoft, Cisco, PwC and EMC signals the continuation of CommVault Next, the company's business transformation for sales, go-to-market strategies, pricing and packaging and technology innovation. The company also announced that it had realigned its structure to create business units to more directly match how customers evaluate, deploy, operate, and purchase technology.
In the consumer IoT, everything is new, and the IT world of bits and bytes holds sway. But industrial and commercial realms encompass operational technology (OT) that has been around for 25 or 50 years. This grittier, pre-IP, more hands-on world has much to gain from Industrial IoT (IIoT) applications and principles. But adding sensors and wireless connectivity won’t work in environments that demand unwavering reliability and performance. In his session at @ThingsExpo, Ron Sege, CEO of Echelon, will discuss how as enterprise IT embraces other IoT-related technology trends, enterprises with i...
When it comes to the Internet of Things, hooking up will get you only so far. If you want customers to commit, you need to go beyond simply connecting products. You need to use the devices themselves to transform how you engage with every customer and how you manage the entire product lifecycle. In his session at @ThingsExpo, Sean Lorenz, Technical Product Manager for Xively at LogMeIn, will show how “product relationship management” can help you leverage your connected devices and the data they generate about customer usage and product performance to deliver extremely compelling and reliabl...
The Internet of Things (IoT) is causing data centers to become radically decentralized and atomized within a new paradigm known as “fog computing.” To support IoT applications, such as connected cars and smart grids, data centers' core functions will be decentralized out to the network's edges and endpoints (aka “fogs”). As this trend takes hold, Big Data analytics platforms will focus on high-volume log analysis (aka “logs”) and rely heavily on cognitive-computing algorithms (aka “cogs”) to make sense of it all.
With several hundred implementations of IoT-enabled solutions in the past 12 months alone, this session will focus on experience over the art of the possible. Many can only imagine the most advanced telematics platform ever deployed, supporting millions of customers, producing tens of thousands events or GBs per trip, and hundreds of TBs per month. With the ability to support a billion sensor events per second, over 30PB of warm data for analytics, and hundreds of PBs for an data analytics archive, in his session at @ThingsExpo, Jim Kaskade, Vice President and General Manager, Big Data & Ana...