Welcome!

Agile Computing Authors: Elizabeth White, Yeshim Deniz, Pat Romanski, Michel Courtoy, ManageEngine IT Matters

Related Topics: Agile Computing, Containers Expo Blog

Agile Computing: Blog Feed Post

Give Your Unstructured Data the Meyers-Briggs

One of the Problems Currently Facing the Enterprise is to Properly Categorize the Data

For those who don’t know, according to the Meyers & Briggs Foundation, part of the Meyers-Briggs Assessment is defined as: The essence of the theory is that much seemingly random variation in the behavior is actually quite orderly and consistent… The same can be said about your data. Much that is seemingly random is consistent and predictable. One of the problems currently facing the enterprise is to properly categorize that data so that its “personality” is well known. You cannot sort (or tier) what you don’t know, and this is a simple proposal for how you might begin such a categorization.

No matter what your organization does, it has a variety of data in a variety of types with a variety of attributes that can be built into indices to help you understand not only what you have, but how much of it you have, what its relative importance is to the organization, and how you can make use of all of this information to help you move data about in an intelligent manner.

Meyers-Briggs uses initials to give your average Joe (or Jane) an easy-to-access summary of a given individual’s personality type, and by extension how to interface with that person. This little tool aims to do the same type of thing for your data, so I kept their initials and mapped them to information about your data that will help you figure out what to do with it. Perhaps a bit gimmicky, but it’s valid, so let’s get started defining your data

We can break data information into two categories – physical and extended. Physical information can be readily accessed and utilized by automated tiering systems like the one built into our own ARX, but extended information is unique to your organization and some of it will fluctuate over time. That is the hard part that interviews and intelligent data analysis will be required to determine. Not insurmountable, but certainly a task, and if you’re a geek that doesn’t like to play with “squishy” data, not an envious task at all. Though knowing this stuff will help you come to logical conclusions about where and how the data will be stored.

Physical Extended
Extension Interest
Timestamp Jurisdiction
Size Permissions
Filesystem Necessity

 

First, clear definitions of each data type.

  • Extension is the file extension. It will not only tell you type of file (generally), it will also tell you aggregate type. An AVI falls into the video category, for example. Yeah, it can be audio too, but most organizations treat the two media types similarly when making decisions strictly on extension.
  • Timestamp the last time that the filesystem shows this file as written. If you have a tool (like ARX) that allows you to accurately track last access date/time also, then you could use that information much more intelligently than last save time.
  • Size Let’s face it, the multi-gigabyte file is going to be treated differently than the 10K file just because it is a big win to get it off of tier one storage and on to something cheaper.
  • Filesystem Files on the SAN generally take more money to keep there than those on the NAS. Now if your SAN is low-end and your NAS high-end, it is possible that this is untrue for you, but either way, knowing what filesystem a file is hosted on helps you to understand what the impact of moving that file will be.
  • Interest How much interest would this file be to ne’er-do-wells that got access to the storage medium it is currently stored on?
  • Jurisdication Who is the ultimate owner of this data? The person who can make decisions about its use, distribution, and access rights?
  • Permissions Who has access to this file, and is it by user or group, is access to this file managed on the file itself, or the file system it is stored on?
  • Necessity How is this file used within the organization? If it went away tomorrow, who would be impacted and how would they be impacted?

The idea is to collect all of this information about your files so that you can make intelligent decisions about how to move that data around and store it in the most appropriate place. As I said above, there are tools to help you with the physical stuff, and some of them help with Permissions also. But you’ll still need to collect the other data, and that’s a lot of work. If you just plain don’t have time to interview director-level people about their team’s data usage and specific files, then start with directories. Something is better than nothing after all, and behaviorally most groups put like data into folders as far as usage, permissions, jurisdiction, and necessity. After all, the fantasy football spreadsheet isn’t generally stored in the new product development folder.

Using these values, you can properly categorize your data, which is the first step to both understanding it and organizing it – and tiering it.

Unlike Meyers-Briggs, these attributes can have multiple non-numeric values, so your tracking will be a little bit more complex than a Meyers-Briggs score, but it will be highly valuable in helping you figure out what to do with your data. Data whose necessity is high will obviously take pride of place on your tier one storage systems – unless it is almost never accessed, which the better version of timestamp could tell you.

If you just don’t have the manhours, cooperation, or desire to work through all of this, then invest in an automated tiering product, let it learn, and turn it on. It will get you 50% there, maybe 5/8ths of the way there, with no significant effort on your part. You’ll have to install and configure it, and monitor it… But the investment is small compared to interviewing business owners and asking them to make definitive statements about all of the data they own. And it gets you started.

In the end, you can’t send stuff that is of high interest unprotected into the cloud, you can’t run stuff that is frequently accessed into an archival format, and you want to check how many movies and audio files you have, where they’re stored, and how much space they take up. So the more you know, the more power over your storage environment you will have.

Meyers-Briggs is a trademark of the Meyers and Briggs Foundation.


Follow me on Twitter icon_facebook

AddThis Feed Button Bookmark and Share

Related Articles and Blogs:

Read the original blog entry...

More Stories By Don MacVittie

Don MacVittie is founder of Ingrained Technology, A technical advocacy and software development consultancy. He has experience in application development, architecture, infrastructure, technical writing,DevOps, and IT management. MacVittie holds a B.S. in Computer Science from Northern Michigan University, and an M.S. in Computer Science from Nova Southeastern University.

@ThingsExpo Stories
We build IoT infrastructure products - when you have to integrate different devices, different systems and cloud you have to build an application to do that but we eliminate the need to build an application. Our products can integrate any device, any system, any cloud regardless of protocol," explained Peter Jung, Chief Product Officer at Pulzze Systems, in this SYS-CON.tv interview at @ThingsExpo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA
With major technology companies and startups seriously embracing Cloud strategies, now is the perfect time to attend 21st Cloud Expo October 31 - November 2, 2017, at the Santa Clara Convention Center, CA, and June 12-14, 2018, at the Javits Center in New York City, NY, and learn what is going on, contribute to the discussions, and ensure that your enterprise is on the right path to Digital Transformation.
In his session at @ThingsExpo, Eric Lachapelle, CEO of the Professional Evaluation and Certification Board (PECB), provided an overview of various initiatives to certify the security of connected devices and future trends in ensuring public trust of IoT. Eric Lachapelle is the Chief Executive Officer of the Professional Evaluation and Certification Board (PECB), an international certification body. His role is to help companies and individuals to achieve professional, accredited and worldwide re...
The Internet giants are fully embracing AI. All the services they offer to their customers are aimed at drawing a map of the world with the data they get. The AIs from these companies are used to build disruptive approaches that cannot be used by established enterprises, which are threatened by these disruptions. However, most leaders underestimate the effect this will have on their businesses. In his session at 21st Cloud Expo, Rene Buest, Director Market Research & Technology Evangelism at Ara...
SYS-CON Events announced today that Enzu will exhibit at SYS-CON's 21st Int\ernational Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Enzu’s mission is to be the leading provider of enterprise cloud solutions worldwide. Enzu enables online businesses to use its IT infrastructure to their competitive advantage. By offering a suite of proven hosting and management services, Enzu wants companies to focus on the core of their ...
Internet of @ThingsExpo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devic...
Amazon started as an online bookseller 20 years ago. Since then, it has evolved into a technology juggernaut that has disrupted multiple markets and industries and touches many aspects of our lives. It is a relentless technology and business model innovator driving disruption throughout numerous ecosystems. Amazon’s AWS revenues alone are approaching $16B a year making it one of the largest IT companies in the world. With dominant offerings in Cloud, IoT, eCommerce, Big Data, AI, Digital Assista...
SYS-CON Events announced today that Cloud Academy named "Bronze Sponsor" of 21st International Cloud Expo which will take place October 31 - November 2, 2017 at the Santa Clara Convention Center in Santa Clara, CA. Cloud Academy is the industry’s most innovative, vendor-neutral cloud technology training platform. Cloud Academy provides continuous learning solutions for individuals and enterprise teams for Amazon Web Services, Microsoft Azure, Google Cloud Platform, and the most popular cloud com...
When growing capacity and power in the data center, the architectural trade-offs between server scale-up vs. scale-out continue to be debated. Both approaches are valid: scale-out adds multiple, smaller servers running in a distributed computing model, while scale-up adds fewer, more powerful servers that are capable of running larger workloads. It’s worth noting that there are additional, unique advantages that scale-up architectures offer. One big advantage is large memory and compute capacity...
No hype cycles or predictions of zillions of things here. IoT is big. You get it. You know your business and have great ideas for a business transformation strategy. What comes next? Time to make it happen. In his session at @ThingsExpo, Jay Mason, Associate Partner at M&S Consulting, presented a step-by-step plan to develop your technology implementation strategy. He discussed the evaluation of communication standards and IoT messaging protocols, data analytics considerations, edge-to-cloud tec...
"When we talk about cloud without compromise what we're talking about is that when people think about 'I need the flexibility of the cloud' - it's the ability to create applications and run them in a cloud environment that's far more flexible,” explained Matthew Finnie, CTO of Interoute, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
SYS-CON Events announced today that MobiDev, a client-oriented software development company, will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. MobiDev is a software company that develops and delivers turn-key mobile apps, websites, web services, and complex software systems for startups and enterprises. Since 2009 it has grown from a small group of passionate engineers and business...
SYS-CON Events announced today that GrapeUp, the leading provider of rapid product development at the speed of business, will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Grape Up is a software company, specialized in cloud native application development and professional services related to Cloud Foundry PaaS. With five expert teams that operate in various sectors of the market acr...
SYS-CON Events announced today that Ayehu will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on October 31 - November 2, 2017 at the Santa Clara Convention Center in Santa Clara California. Ayehu provides IT Process Automation & Orchestration solutions for IT and Security professionals to identify and resolve critical incidents and enable rapid containment, eradication, and recovery from cyber security breaches. Ayehu provides customers greater control over IT infras...
With the introduction of IoT and Smart Living in every aspect of our lives, one question has become relevant: What are the security implications? To answer this, first we have to look and explore the security models of the technologies that IoT is founded upon. In his session at @ThingsExpo, Nevi Kaja, a Research Engineer at Ford Motor Company, discussed some of the security challenges of the IoT infrastructure and related how these aspects impact Smart Living. The material was delivered interac...
Artificial intelligence, machine learning, neural networks. We’re in the midst of a wave of excitement around AI such as hasn’t been seen for a few decades. But those previous periods of inflated expectations led to troughs of disappointment. Will this time be different? Most likely. Applications of AI such as predictive analytics are already decreasing costs and improving reliability of industrial machinery. Furthermore, the funding and research going into AI now comes from a wide range of com...
In his session at Cloud Expo, Alan Winters, an entertainment executive/TV producer turned serial entrepreneur, presented a success story of an entrepreneur who has both suffered through and benefited from offshore development across multiple businesses: The smart choice, or how to select the right offshore development partner Warning signs, or how to minimize chances of making the wrong choice Collaboration, or how to establish the most effective work processes Budget control, or how to ma...
IoT solutions exploit operational data generated by Internet-connected smart “things” for the purpose of gaining operational insight and producing “better outcomes” (for example, create new business models, eliminate unscheduled maintenance, etc.). The explosive proliferation of IoT solutions will result in an exponential growth in the volume of IoT data, precipitating significant Information Governance issues: who owns the IoT data, what are the rights/duties of IoT solutions adopters towards t...
SYS-CON Events announced today that CA Technologies has been named "Platinum Sponsor" of SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. CA Technologies helps customers succeed in a future where every business - from apparel to energy - is being rewritten by software. From planning to development to management to security, CA creates software that fuels transformation for companies in the applic...
SYS-CON Events announced today that IBM has been named “Diamond Sponsor” of SYS-CON's 21st Cloud Expo, which will take place on October 31 through November 2nd 2017 at the Santa Clara Convention Center in Santa Clara, California.