Welcome!

Agile Computing Authors: Mano Marks, Liz McMillan, Elizabeth White, Mehdi Daoudi, XebiaLabs Blog

Blog Feed Post

Big Data as Core, Big Data as Context, and Big Data as Buzzword Bingo

3711242567_7a2f9e6f13_zIt’s neither particularly newsworthy nor insightful to suggest that ‘Big Data’ gets everywhere these days, but two recent items reminded me of the gulf between credible execution of a big data play and the more questionable tacking of the big data meme onto an otherwise useful product.

Christmas is coming. Which means skating, and pantomimes (Captain Jack! And the Krankies!), and surprisingly expensive daughter shops, and pie with chicken and banana. But in amongst that lot, the weekend’s email and RSS brought news of

an ideal solution to store, manage and archive big data

and a

service built specifically for Fortune 1000 enterprises who want to rapidly explore how big data technology can unlock revenue from their data.

(both with my emphasis)

Infochimps has been around since 2009, and I’ve been following them with interest. CTO and Co-Founder Flip Kromer and I recorded podcasts in 2009 and early 2012, and we continue to meet up from time to time. From humble beginnings, the company grew to become one of a handful of credible Data Market offerings, before moving on to contribute key pieces of code to projects such as VMware’s Serengeti. Earlier this year, Infochimps’ broader ambitions began to become public as the Infochimps Platform rolled out. In August, the Platform gained streaming capabilities that helped propel it beyond any early reliance upon Hadoop. Then, this month, things got really interesting with the arrival of the Infochimps Enterprise Cloud. As Alex Williams reported for TechCrunch on Monday,

Infochimps data scientists and engineers developed the platform so they could collect lots of data and perform complex analytics along the way. A customer can pull in data from CRM systems and any of the other app silos where data pools then combine it with the data from Facebook, Twitter, and other services. The data flows into Infochimps’ data-delivery service and is cleaned up along the way. Data gets enriched, as needed, with other pieces of information such as demographic data.

The service works with any kind of database. Infochimps can implement any combination, including relational for SQL-like queries, and NoSQL for Hadoop jobs and big data storage. Analysis tools on the back-end provide the capability to create visuals and reports.

The company is setting itself some bold targets, seeking to speed up system deployments, making it easier for existing staff to do new things with data they already own, and freeing users to deploy a wide range of big data tools beyond the default of the cuddly elephant. And they’re targeting this directly at the Fortune 1000; companies with huge IT operations, demanding requirements, and an expectation of support, service and quality, all day, every day. For a small company of around 30 employees, which raised $1.55 million back in 2010 and hasn’t reported an investment since, that’s a big ask.

If even a fraction of what the Enterprise Cloud promises is available today, or demonstrably around the corner, then that team of 30 must be spending most of their time fending off a swarm of investors and acquirers. A nice problem to have, but a problem all the same.

I look forward to seeing real examples of the uses to which enterprise customers begin putting the Enterprise Cloud. I’ll also be watching with interest for rumours of acquisition or investment, both of which are bound to come.

The other piece of news also came from an established company. This time, consumer and small business backup provider Genie9. The company has a new backup product out, called Zoolz, and is making much of the integral “Cold Storage™ Technology” (Ugh!) that gives users reasonably straightforward access to Amazon’s very cheap Glacier storage service.

Personally, I achieve my backup and archival needs through a combination of DropBox, Google Drive, Spanning Backup, a Time Capsule and Arq (complete with its own non-™ hooks into Glacier). But that’s me. A one man band, with a particular set of devices and workflows, and it’s an arrangement that has grown up rather organically.

Zoolz makes perfect sense as a backup solution, and from a brief play with the tool it appears intuitive, capable, and affordable. The Glacier integration is also good, for those things you want to keep, but which you don’t need to access regularly. I have no problem with the tool at all, but what did (and does) bemuse me was the emphasis upon its role in meeting big data requirements.

Zoolz is designed with big data support in mind and will be a game changer to help companies move all their data to the cloud in a secure and fast way that is cheaper than tapes and traditional solutions.

Huh?

The web site devotes a whole page to the big data capabilities of Zoolz, but I’m singularly unconvinced. The whole point about big data, surely, is that you work with it? You pour it into very capable tools that allow you to hold it in (or close to) memory, and you chop and change it in a variety of ways whilst seeking insight? You don’t park it 3-5 hours away in an Amazon cold storage facility and think “job done,” just because Zoolz offers “photo preview” !

Zoolz (through Glacier) offers a place to park large volumes of data that you no longer wish to work with, but it does nothing at all to help people ingest, process, analyse or understand big data. Moving large volumes of data around is slow and expensive. Processes to work with data are often scripted or otherwise automated, and tied into workflows that make sense within the context of the analytic tools (like Hadoop, say) to be used. It’s wholly unclear that Zoolz’s pretty UI and consumer/small business workflows make any sense in that context whatsoever.

Personally, Genie9, I would be proud of what I’ve made in Zoolz. But I’d drop the ‘big data’ stuff. It doesn’t fit.

Bingo card image by Flickr user Sara

Read the original blog entry...

More Stories By Paul Miller

Paul Miller works at the interface between the worlds of Cloud Computing and the Semantic Web, providing the insights that enable you to exploit the next wave as we approach the World Wide Database.

He blogs at www.cloudofdata.com.

@ThingsExpo Stories
WebRTC is bringing significant change to the communications landscape that will bridge the worlds of web and telephony, making the Internet the new standard for communications. Cloud9 took the road less traveled and used WebRTC to create a downloadable enterprise-grade communications platform that is changing the communication dynamic in the financial sector. In his session at @ThingsExpo, Leo Papadopoulos, CTO of Cloud9, discussed the importance of WebRTC and how it enables companies to focus o...
Big Data engines are powering a lot of service businesses right now. Data is collected from users from wearable technologies, web behaviors, purchase behavior as well as several arbitrary data points we’d never think of. The demand for faster and bigger engines to crunch and serve up the data to services is growing exponentially. You see a LOT of correlation between “Cloud” and “Big Data” but on Big Data and “Hybrid,” where hybrid hosting is the sanest approach to the Big Data Infrastructure pro...
In his General Session at 16th Cloud Expo, David Shacochis, host of The Hybrid IT Files podcast and Vice President at CenturyLink, investigated three key trends of the “gigabit economy" though the story of a Fortune 500 communications company in transformation. Narrating how multi-modal hybrid IT, service automation, and agile delivery all intersect, he will cover the role of storytelling and empathy in achieving strategic alignment between the enterprise and its information technology.
Buzzword alert: Microservices and IoT at a DevOps conference? What could possibly go wrong? In this Power Panel at DevOps Summit, moderated by Jason Bloomberg, the leading expert on architecting agility for the enterprise and president of Intellyx, panelists peeled away the buzz and discuss the important architectural principles behind implementing IoT solutions for the enterprise. As remote IoT devices and sensors become increasingly intelligent, they become part of our distributed cloud enviro...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo 2016 in New York. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place June 6-8, 2017, at the Javits Center in New York City, New York, is co-located with 20th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry p...
"LinearHub provides smart video conferencing, which is the Roundee service, and we archive all the video conferences and we also provide the transcript," stated Sunghyuk Kim, CEO of LinearHub, in this SYS-CON.tv interview at @ThingsExpo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
Things are changing so quickly in IoT that it would take a wizard to predict which ecosystem will gain the most traction. In order for IoT to reach its potential, smart devices must be able to work together. Today, there are a slew of interoperability standards being promoted by big names to make this happen: HomeKit, Brillo and Alljoyn. In his session at @ThingsExpo, Adam Justice, vice president and general manager of Grid Connect, will review what happens when smart devices don’t work togethe...
"There's a growing demand from users for things to be faster. When you think about all the transactions or interactions users will have with your product and everything that is between those transactions and interactions - what drives us at Catchpoint Systems is the idea to measure that and to analyze it," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York Ci...
The 20th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held June 6-8, 2017, at the Javits Center in New York City, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Containers, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal ...
Discover top technologies and tools all under one roof at April 24–28, 2017, at the Westin San Diego in San Diego, CA. Explore the Mobile Dev + Test and IoT Dev + Test Expo and enjoy all of these unique opportunities: The latest solutions, technologies, and tools in mobile or IoT software development and testing. Meet one-on-one with representatives from some of today's most innovative organizations
20th Cloud Expo, taking place June 6-8, 2017, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy.
SYS-CON Events announced today that Super Micro Computer, Inc., a global leader in Embedded and IoT solutions, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 7-9, 2017, at the Javits Center in New York City, NY. Supermicro (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology, is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, HPC and E...
SYS-CON Events announced today that Linux Academy, the foremost online Linux and cloud training platform and community, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Linux Academy was founded on the belief that providing high-quality, in-depth training should be available at an affordable price. Industry leaders in quality training, provided services, and student certification passes, its goal is to c...
WebRTC is the future of browser-to-browser communications, and continues to make inroads into the traditional, difficult, plug-in web communications world. The 6th WebRTC Summit continues our tradition of delivering the latest and greatest presentations within the world of WebRTC. Topics include voice calling, video chat, P2P file sharing, and use cases that have already leveraged the power and convenience of WebRTC.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this SYS-CON.tv interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
WebRTC sits at the intersection between VoIP and the Web. As such, it poses some interesting challenges for those developing services on top of it, but also for those who need to test and monitor these services. In his session at WebRTC Summit, Tsahi Levent-Levi, co-founder of testRTC, reviewed the various challenges posed by WebRTC when it comes to testing and monitoring and on ways to overcome them.
"A lot of times people will come to us and have a very diverse set of requirements or very customized need and we'll help them to implement it in a fashion that you can't just buy off of the shelf," explained Nick Rose, CTO of Enzu, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
DevOps is being widely accepted (if not fully adopted) as essential in enterprise IT. But as Enterprise DevOps gains maturity, expands scope, and increases velocity, the need for data-driven decisions across teams becomes more acute. DevOps teams in any modern business must wrangle the ‘digital exhaust’ from the delivery toolchain, "pervasive" and "cognitive" computing, APIs and services, mobile devices and applications, the Internet of Things, and now even blockchain. In this power panel at @...
Every successful software product evolves from an idea to an enterprise system. Notably, the same way is passed by the product owner's company. In his session at 20th Cloud Expo, Oleg Lola, CEO of MobiDev, will provide a generalized overview of the evolution of a software product, the product owner, the needs that arise at various stages of this process, and the value brought by a software development partner to the product owner as a response to these needs.
WebRTC services have already permeated corporate communications in the form of videoconferencing solutions. However, WebRTC has the potential of going beyond and catalyzing a new class of services providing more than calls with capabilities such as mass-scale real-time media broadcasting, enriched and augmented video, person-to-machine and machine-to-machine communications. In his session at @ThingsExpo, Luis Lopez, CEO of Kurento, introduced the technologies required for implementing these idea...