.
.
Many people, who should know better, are in a state of denial regarding what the Foreign Intelligence data collection capabilities of the NSA really are.
Example:
The above are the wrong questions if you want to arrive at the truth.John wrote:
You have to apply a common sense test to articles like this.
* How much storage would be required to store every e-mail message
from every user in the world, including many spam messages?
* How much processing power would be required to analyze each
message for whatever keyword searches they're doing?
* Same with online chats and all the other data?
* How much disk storage would be required to store all that?
* Assuming that the mean time between failure of disks is a
couple of years, how would the issue of backing up all this
data be handled?
* How many computers would be required to do all this
processing?
* How much network bandwidth would all this suck up?
* How big a staff would be required to keep this facility up?
* How many hardware and software vendors would have to know
what was going on?
...
Some Examples of the questions that should be ask, and then ground checked for truth:
Would it be more Band Width than currently exists on the Internet ?John wrote:
* How much network bandwidth would all this ( this being Filtering all the data moving on the Internet ) suck up?
Is the World Wide Internet Architecture topology a distributed network of equal size data pipes, or is it a hierarchical structure with distinct massive backbone pipes ?
Are there a number of collocated backbone routers ( at a relatively small number of collocation communications sites ) that interconnect the different proprietary Backbone pipes ?
Are there massive numbers of dark ( unused ) fibers running between many of the collocated backbone inter tie locations ?
Are there also large numbers of dark ( unused ) fibers running in parallel to many of the proprietary backbones ?
Given the answers to the above ( and your desire to avoid duplicating the entire internet ) where would you locate the data storage locations where the "soft data" filtered would be stored for a few hours or a few days ?
Does this ground truth against the sever locations in the top secret data leaked by the CIA / NSA traitor ?
How do you design databases to avoid searching through every character in every record stored in a database to find the text string your are looking for ?John wrote:
* How much processing power would be required to analyze each message for whatever keyword searches they're doing?
What is the schema you would use to structure the template you would use to filter the data being filtered to analyze it ?
Where would you get that schema ( packet structures ) necessary to create a template for the data stream you are capturing and storing ( for whatever short period of time ) ?
How would you index the data stream you are filtering so that you could use meta data obtained from a different source to look at just the records in your data base related to the packets in the stream you have both stored and are interested in ( so you do not need to do keyword text searchers of every record in the database).
Where would you get that meta-data ?
How would you index the the indexes so you could tie data coming from a workstation computer, telephone switch or communications server ( such as an SMTP server, or a Instant Messaging server, or a Face Book Server, etc. ) to one or several records of the steam ( soft data ) you temporarily stored in your database ?
Where would you obtain the Meta Data ( Hard Data ) you needed to create the Index of the Index ?
How would you index the index of the index to tie a specific telephone number ( or a specific e-mail address, or a specific face book user, etc.) to the index of a specific telephone switch, or specific work station, or specific server ?
Where would you obtain the Meta Data ( Hard Data ) you needed to create the index of the Index of the index?
How would you generate other third level indexes on the fly by associating patterns in the relations ship of stream records associated with the same first and second level indexes during the same time period they are associated with related third level indexes you are interested in ?
Does this ground truth against the pattern of leaks, confirmations and non-denial denials of the leaks, as they regard the collection of Meta Data ( index information ) from various technology companies ( the leaks, confirmations and non-denial denials, having occurred over the last few weeks and months ) ?
What is the total amount of data stored on hard disks in online servers, online computer workstations, telephone switches, and other online devices at anyone time ?John wrote:
* How much storage would be required to store every e-mail message ( or just the text portion of the e-mail message ) from every user in the world, including many spam messages?
* Same with online ( text ) chats ( and text instant messaging, and skype text messaging, and face book text messaging ) and all the other ( text ) data?
* How much disk storage would be required to store all that?
What percentage of that is text data? Music ? Digital Voice Data ( like telephone calls ) ? Low Quality Video ( like the old National Broadcast TV standard ) ? DVD quality video ? High Definition Blue Ray Quality video ?
What percentage of that total data stored on "online disks" travels across the internet in an hour ? in a day ? in a week ? in a month ?
What is the percentage of Internet content is text data ?
What form of data storage get's cheaper and cheaper per Giga-byte as the amount of data stored scales larger and larger ?
What form of data storage becomes virtually immune from hard disk failures as a form of data loss as the amount of data stored scales larger and larger ?
What is the acronym for Redundant Arrays of Inexpensive Disks ?
Who created RAID Systems and when were they created ?
Who funded, and continues to fund, the development of ever more capable, ever larger, ever more efficient, and ever more robust ( immune from performance degradation due to the failures of individual disks ) Raid designs ?
Given what a tiny percentage of all the internet content data in the world is text data, sent as messages during any given period of time, does the original question have any meaning related to the NSA ?
Did the below linked presentation even claim that ALL Internet text content data ( text soft data ) was stored for years, let alone decades - or forever ?
http://www.theguardian.com/world/intera ... esentation
I believe the above is more than enough examples.