Doer
Well-Known Member
Just doing a bunch of planning at work. And it is all about Data Mining this year....again. Data collection is a database. We hear GIOG. Garbage In, Garbage Out.
But, when we try to get information out across time and multiple sources, it can still be garbage, but we call it data Mining. Mine data to process information.
So, have you asked yourself, do we know how much data there is? Not specifically, but it is estimated at 2.7 zettabytes just in the digital domain, leaving out all paper. But, it grows at approx. 5 exabytes, per DAY. We are expecting, 35 Zettabytes of data to sell mining tools, into by 2020.
just like the population, it keeps growing. And like the population, there may be a tipping point where we begin to lose information, like we can adjust our birth rates. (well under way, as a matter of fact)
And almost all the information we have today is more like plain dirt. 80% of all information is unstructured. So, almost all the important stuff has to be sifted, like placer panning for flour gold. A data mine is like a hard rock, lode mine. Things are organized and we are not shifting tons of dirt to find one flake of information.
There is more opportunity in this world today than has every existed over all time, in my mind. Plenty of work for me, means plenty of work for you.
Just to give an idea.,,,, A gigabyte is 3 sets of Zeros. 1,000,000,000. A Zettabyte is 6 sets of Zeros.
But, when we try to get information out across time and multiple sources, it can still be garbage, but we call it data Mining. Mine data to process information.
So, have you asked yourself, do we know how much data there is? Not specifically, but it is estimated at 2.7 zettabytes just in the digital domain, leaving out all paper. But, it grows at approx. 5 exabytes, per DAY. We are expecting, 35 Zettabytes of data to sell mining tools, into by 2020.
just like the population, it keeps growing. And like the population, there may be a tipping point where we begin to lose information, like we can adjust our birth rates. (well under way, as a matter of fact)
And almost all the information we have today is more like plain dirt. 80% of all information is unstructured. So, almost all the important stuff has to be sifted, like placer panning for flour gold. A data mine is like a hard rock, lode mine. Things are organized and we are not shifting tons of dirt to find one flake of information.
There is more opportunity in this world today than has every existed over all time, in my mind. Plenty of work for me, means plenty of work for you.
Just to give an idea.,,,, A gigabyte is 3 sets of Zeros. 1,000,000,000. A Zettabyte is 6 sets of Zeros.