Click here to close now.

Welcome!

Web 2.0 Authors: Marty Puranik, Pat Romanski, Elizabeth White, Liz McMillan, Aria Blog

Blog Feed Post

Big Data as Core, Big Data as Context, and Big Data as Buzzword Bingo

3711242567_7a2f9e6f13_zIt’s neither particularly newsworthy nor insightful to suggest that ‘Big Data’ gets everywhere these days, but two recent items reminded me of the gulf between credible execution of a big data play and the more questionable tacking of the big data meme onto an otherwise useful product.

Christmas is coming. Which means skating, and pantomimes (Captain Jack! And the Krankies!), and surprisingly expensive daughter shops, and pie with chicken and banana. But in amongst that lot, the weekend’s email and RSS brought news of

an ideal solution to store, manage and archive big data

and a

service built specifically for Fortune 1000 enterprises who want to rapidly explore how big data technology can unlock revenue from their data.

(both with my emphasis)

Infochimps has been around since 2009, and I’ve been following them with interest. CTO and Co-Founder Flip Kromer and I recorded podcasts in 2009 and early 2012, and we continue to meet up from time to time. From humble beginnings, the company grew to become one of a handful of credible Data Market offerings, before moving on to contribute key pieces of code to projects such as VMware’s Serengeti. Earlier this year, Infochimps’ broader ambitions began to become public as the Infochimps Platform rolled out. In August, the Platform gained streaming capabilities that helped propel it beyond any early reliance upon Hadoop. Then, this month, things got really interesting with the arrival of the Infochimps Enterprise Cloud. As Alex Williams reported for TechCrunch on Monday,

Infochimps data scientists and engineers developed the platform so they could collect lots of data and perform complex analytics along the way. A customer can pull in data from CRM systems and any of the other app silos where data pools then combine it with the data from Facebook, Twitter, and other services. The data flows into Infochimps’ data-delivery service and is cleaned up along the way. Data gets enriched, as needed, with other pieces of information such as demographic data.

The service works with any kind of database. Infochimps can implement any combination, including relational for SQL-like queries, and NoSQL for Hadoop jobs and big data storage. Analysis tools on the back-end provide the capability to create visuals and reports.

The company is setting itself some bold targets, seeking to speed up system deployments, making it easier for existing staff to do new things with data they already own, and freeing users to deploy a wide range of big data tools beyond the default of the cuddly elephant. And they’re targeting this directly at the Fortune 1000; companies with huge IT operations, demanding requirements, and an expectation of support, service and quality, all day, every day. For a small company of around 30 employees, which raised $1.55 million back in 2010 and hasn’t reported an investment since, that’s a big ask.

If even a fraction of what the Enterprise Cloud promises is available today, or demonstrably around the corner, then that team of 30 must be spending most of their time fending off a swarm of investors and acquirers. A nice problem to have, but a problem all the same.

I look forward to seeing real examples of the uses to which enterprise customers begin putting the Enterprise Cloud. I’ll also be watching with interest for rumours of acquisition or investment, both of which are bound to come.

The other piece of news also came from an established company. This time, consumer and small business backup provider Genie9. The company has a new backup product out, called Zoolz, and is making much of the integral “Cold Storage™ Technology” (Ugh!) that gives users reasonably straightforward access to Amazon’s very cheap Glacier storage service.

Personally, I achieve my backup and archival needs through a combination of DropBox, Google Drive, Spanning Backup, a Time Capsule and Arq (complete with its own non-™ hooks into Glacier). But that’s me. A one man band, with a particular set of devices and workflows, and it’s an arrangement that has grown up rather organically.

Zoolz makes perfect sense as a backup solution, and from a brief play with the tool it appears intuitive, capable, and affordable. The Glacier integration is also good, for those things you want to keep, but which you don’t need to access regularly. I have no problem with the tool at all, but what did (and does) bemuse me was the emphasis upon its role in meeting big data requirements.

Zoolz is designed with big data support in mind and will be a game changer to help companies move all their data to the cloud in a secure and fast way that is cheaper than tapes and traditional solutions.

Huh?

The web site devotes a whole page to the big data capabilities of Zoolz, but I’m singularly unconvinced. The whole point about big data, surely, is that you work with it? You pour it into very capable tools that allow you to hold it in (or close to) memory, and you chop and change it in a variety of ways whilst seeking insight? You don’t park it 3-5 hours away in an Amazon cold storage facility and think “job done,” just because Zoolz offers “photo preview” !

Zoolz (through Glacier) offers a place to park large volumes of data that you no longer wish to work with, but it does nothing at all to help people ingest, process, analyse or understand big data. Moving large volumes of data around is slow and expensive. Processes to work with data are often scripted or otherwise automated, and tied into workflows that make sense within the context of the analytic tools (like Hadoop, say) to be used. It’s wholly unclear that Zoolz’s pretty UI and consumer/small business workflows make any sense in that context whatsoever.

Personally, Genie9, I would be proud of what I’ve made in Zoolz. But I’d drop the ‘big data’ stuff. It doesn’t fit.

Bingo card image by Flickr user Sara

Read the original blog entry...

More Stories By Paul Miller

Paul Miller works at the interface between the worlds of Cloud Computing and the Semantic Web, providing the insights that enable you to exploit the next wave as we approach the World Wide Database. He blogs at www.cloudofdata.com.

@ThingsExpo Stories
One of the biggest impacts of the Internet of Things is and will continue to be on data; specifically data volume, management and usage. Companies are scrambling to adapt to this new and unpredictable data reality with legacy infrastructure that cannot handle the speed and volume of data. In his session at @ThingsExpo, Don DeLoach, CEO and president of Infobright, will discuss how companies need to rethink their data infrastructure to participate in the IoT, including: Data storage: Understanding the kinds of data: structured, unstructured, big/small? Analytics: What kinds and how responsiv...
Since 2008 and for the first time in history, more than half of humans live in urban areas, urging cities to become “smart.” Today, cities can leverage the wide availability of smartphones combined with new technologies such as Beacons or NFC to connect their urban furniture and environment to create citizen-first services that improve transportation, way-finding and information delivery. In her session at @ThingsExpo, Laetitia Gazel-Anthoine, CEO of Connecthings, will focus on successful use cases.
The Workspace-as-a-Service (WaaS) market will grow to $6.4B by 2018. In his session at 16th Cloud Expo, Seth Bostock, CEO of IndependenceIT, will begin by walking the audience through the evolution of Workspace as-a-Service, where it is now vs. where it going. To look beyond the desktop we must understand exactly what WaaS is, who the users are, and where it is going in the future. IT departments, ISVs and service providers must look to workflow and automation capabilities to adapt to growing demand and the rapidly changing workspace model.
Sensor-enabled things are becoming more commonplace, precursors to a larger and more complex framework that most consider the ultimate promise of the IoT: things connecting, interacting, sharing, storing, and over time perhaps learning and predicting based on habits, behaviors, location, preferences, purchases and more. In his session at @ThingsExpo, Tom Wesselman, Director of Communications Ecosystem Architecture at Plantronics, will examine the still nascent IoT as it is coalescing, including what it is today, what it might ultimately be, the role of wearable tech, and technology gaps stil...
Almost everyone sees the potential of Internet of Things but how can businesses truly unlock that potential. The key will be in the ability to discover business insight in the midst of an ocean of Big Data generated from billions of embedded devices via Systems of Discover. Businesses will also need to ensure that they can sustain that insight by leveraging the cloud for global reach, scale and elasticity.
The Internet of Things (IoT) promises to evolve the way the world does business; however, understanding how to apply it to your company can be a mystery. Most people struggle with understanding the potential business uses or tend to get caught up in the technology, resulting in solutions that fail to meet even minimum business goals. In his session at @ThingsExpo, Jesse Shiah, CEO / President / Co-Founder of AgilePoint Inc., showed what is needed to leverage the IoT to transform your business. He discussed opportunities and challenges ahead for the IoT from a market and technical point of vie...
IoT is still a vague buzzword for many people. In his session at @ThingsExpo, Mike Kavis, Vice President & Principal Cloud Architect at Cloud Technology Partners, discussed the business value of IoT that goes far beyond the general public's perception that IoT is all about wearables and home consumer services. He also discussed how IoT is perceived by investors and how venture capitalist access this space. Other topics discussed were barriers to success, what is new, what is old, and what the future may hold. Mike Kavis is Vice President & Principal Cloud Architect at Cloud Technology Pa...
Hadoop as a Service (as offered by handful of niche vendors now) is a cloud computing solution that makes medium and large-scale data processing accessible, easy, fast and inexpensive. In his session at Big Data Expo, Kumar Ramamurthy, Vice President and Chief Technologist, EIM & Big Data, at Virtusa, will discuss how this is achieved by eliminating the operational challenges of running Hadoop, so one can focus on business growth. The fragmented Hadoop distribution world and various PaaS solutions that provide a Hadoop flavor either make choices for customers very flexible in the name of opti...
The true value of the Internet of Things (IoT) lies not just in the data, but through the services that protect the data, perform the analysis and present findings in a usable way. With many IoT elements rooted in traditional IT components, Big Data and IoT isn’t just a play for enterprise. In fact, the IoT presents SMBs with the prospect of launching entirely new activities and exploring innovative areas. CompTIA research identifies several areas where IoT is expected to have the greatest impact.
Advanced Persistent Threats (APTs) are increasing at an unprecedented rate. The threat landscape of today is drastically different than just a few years ago. Attacks are much more organized and sophisticated. They are harder to detect and even harder to anticipate. In the foreseeable future it's going to get a whole lot harder. Everything you know today will change. Keeping up with this changing landscape is already a daunting task. Your organization needs to use the latest tools, methods and expertise to guard against those threats. But will that be enough? In the foreseeable future attacks w...
The Internet of Things (IoT) is rapidly in the process of breaking from its heretofore relatively obscure enterprise applications (such as plant floor control and supply chain management) and going mainstream into the consumer space. More and more creative folks are interconnecting everyday products such as household items, mobile devices, appliances and cars, and unleashing new and imaginative scenarios. We are seeing a lot of excitement around applications in home automation, personal fitness, and in-car entertainment and this excitement will bleed into other areas. On the commercial side, m...
Disruptive macro trends in technology are impacting and dramatically changing the "art of the possible" relative to supply chain management practices through the innovative use of IoT, cloud, machine learning and Big Data to enable connected ecosystems of engagement. Enterprise informatics can now move beyond point solutions that merely monitor the past and implement integrated enterprise fabrics that enable end-to-end supply chain visibility to improve customer service delivery and optimize supplier management. Learn about enterprise architecture strategies for designing connected systems tha...
Dale Kim is the Director of Industry Solutions at MapR. His background includes a variety of technical and management roles at information technology companies. While his experience includes work with relational databases, much of his career pertains to non-relational data in the areas of search, content management, and NoSQL, and includes senior roles in technical marketing, sales engineering, and support engineering. Dale holds an MBA from Santa Clara University, and a BA in Computer Science from the University of California, Berkeley.
Wearable devices have come of age. The primary applications of wearables so far have been "the Quantified Self" or the tracking of one's fitness and health status. We propose the evolution of wearables into social and emotional communication devices. Our BE(tm) sensor uses light to visualize the skin conductance response. Our sensors are very inexpensive and can be massively distributed to audiences or groups of any size, in order to gauge reactions to performances, video, or any kind of presentation. In her session at @ThingsExpo, Jocelyn Scheirer, CEO & Founder of Bionolux, will discuss ho...
The cloud is now a fact of life but generating recurring revenues that are driven by solutions and services on a consumption model have been hard to implement, until now. In their session at 16th Cloud Expo, Ermanno Bonifazi, CEO & Founder of Solgenia, and Ian Khan, Global Strategic Positioning & Brand Manager at Solgenia, will discuss how a top European telco has leveraged the innovative recurring revenue generating capability of the consumption cloud to enable a unique cloud monetization model to drive results.
Docker is an excellent platform for organizations interested in running microservices. It offers portability and consistency between development and production environments, quick provisioning times, and a simple way to isolate services. In his session at DevOps Summit at 16th Cloud Expo, Shannon Williams, co-founder of Rancher Labs, will walk through these and other benefits of using Docker to run microservices, and provide an overview of RancherOS, a minimalist distribution of Linux designed expressly to run Docker. He will also discuss Rancher, an orchestration and service discovery platf...
As organizations shift toward IT-as-a-service models, the need for managing and protecting data residing across physical, virtual, and now cloud environments grows with it. CommVault can ensure protection &E-Discovery of your data – whether in a private cloud, a Service Provider delivered public cloud, or a hybrid cloud environment – across the heterogeneous enterprise. In his session at 16th Cloud Expo, Randy De Meno, Chief Technologist - Windows Products and Microsoft Partnerships, will discuss how to cut costs, scale easily, and unleash insight with CommVault Simpana software, the only si...
Analytics is the foundation of smart data and now, with the ability to run Hadoop directly on smart storage systems like Cloudian HyperStore, enterprises will gain huge business advantages in terms of scalability, efficiency and cost savings as they move closer to realizing the potential of the Internet of Things. In his session at 16th Cloud Expo, Paul Turner, technology evangelist and CMO at Cloudian, Inc., will discuss the revolutionary notion that the storage world is transitioning from mere Big Data to smart data. He will argue that today’s hybrid cloud storage solutions, with commodity...
Cloud data governance was previously an avoided function when cloud deployments were relatively small. With the rapid adoption in public cloud – both rogue and sanctioned, it’s not uncommon to find regulated data dumped into public cloud and unprotected. This is why enterprises and cloud providers alike need to embrace a cloud data governance function and map policies, processes and technology controls accordingly. In her session at 15th Cloud Expo, Evelyn de Souza, Data Privacy and Compliance Strategy Leader at Cisco Systems, will focus on how to set up a cloud data governance program and s...
Roberto Medrano, Executive Vice President at SOA Software, had reached 30,000 page views on his home page - http://RobertoMedrano.SYS-CON.com/ - on the SYS-CON family of online magazines, which includes Cloud Computing Journal, Internet of Things Journal, Big Data Journal, and SOA World Magazine. He is a recognized executive in the information technology fields of SOA, internet security, governance, and compliance. He has extensive experience with both start-ups and large companies, having been involved at the beginning of four IT industries: EDA, Open Systems, Computer Security and now SOA.