Jump to content
ATX Community

Big Data meets Big Gov't: New IRS spy software


Max W

Recommended Posts

18 hours ago, FDNY said:

Can't believe anything those guys say.

I dunno; they have reported correctly in the past on government intrusions so I would not discount it out of hand.  Take everything with a small boulder of salt, and research from original sources oneself.  There is plenty of propaganda out there, masquerading as 'news', on all sides.

  • Like 2
Link to comment
Share on other sites

2 hours ago, Catherine said:

I dunno; they have reported correctly in the past on government intrusions so I would not discount it out of hand.  Take everything with a small boulder of salt, and research from original sources oneself.  There is plenty of propaganda out there, masquerading as 'news', on all sides.

Yes, I'm sure that's true.   And both sides either far right and far left irritate me.  I've got both in my family and friends and I'm ready to look for a big rock to get under.

  • Like 3
  • Haha 1
Link to comment
Share on other sites

Here is the article, cut and pasted. One thing to be aware of is that the author is known to use a lot of hyperbole.  However, the basic substance is correct as it comes from IRS Pub

 3744.    https://www.irs.gov/pub/irs-pdf/p3744.pdf

By Daniel J. Pilla

One of the reasons identify theft is considered by the Treasury Inspector General for Tax Administration to be the crime of the century is because of the IRS. The Internal Revenue Service makes growing demands for information about people’s businesses and private lives every day. There is no such thing as personal privacy these days. That the IRS sends citizens a so-called “Privacy Act Notice” in all its mailings is a farce. The IRS lays claim to your data without court authority more so than any other government agency. And to make matters worse, they share the data with any other federal, state or local government agency claiming an interest, including foreign governments.

A river of data

In 2019, there will be about 152 million individual tax returns filed with the IRS. There will be roughly another 100 million business tax returns filed. There will be millions more miscellaneous tax returns, including trust, estate and gift tax returns. On top of that, over 3.6 BILLION information returns (Forms W-2, 1099, etc.) will be filed. There is quite literally a river of data flowing into the agency. The flow cannot be stopped, and as far as the IRS is concerned, they need even more.

For example, one of the six “Strategic Goals” presented in the IRS’ 2018-2022 Strategic Plan is to increase its access to data, and use that data more effectively to drive its agency-wide decision making, as well as case evaluations and selections for enforcement purposes. See: IRS Publication 3744 (4-2018). This is consistent with the IRS goal of becoming a “data driven agency.”

The IRS is awash in data. The 2018-2022 Strategic Plan boasts that the IRS’ volume of data was 100 times larger in 2017 than it was 10 years prior. In 2018, the IRS Criminal Investigation unit alone collected 1.67 terabytes of data from various sources. A terabyte is 1,099,511,627,776 bytes, or 1,024 gigabytes of data. I’m told that approximately 900,000 plain text files can fit into a single gigabyte. The number of users in the IRS with access to that data has increased 23 times (Strategic Plan, p. 19) in the past 10 years.

Managing massive data

How do you manage, process and assimilate such a massive amount of data to the point where it becomes usable? The 2018-2022 Strategic Plan expresses the goal to “invest in analytics and visualization software and tools, and develop processes to support analytics in IRS operations” (p. 20). The end game is presented in these words:

Advancements in how data is collected, stored, accessed and analyzed will allow us to deploy data better. We’ll standardize our data processes and protocols and encourage collaboration among all IRS business units. Increased interoperability of data systems and sources will enhance the secure and seamless flow of data to enable greater authorized access to information. We’ll invest in training to develop more advanced analytics skill sets across the IRS, and use data to improve our business processes. (Strategic Plan, p. 19.)

The investment in analytics was recently undertaken – in a big way.

Big Government, meet Big Data

On Sept. 27, 2018, the IRS entered into a contract with Palantir Technologies of Palo Alto, California, to handle the task of data assimilation. The contract calls for Palantir to provide hardware, software and training to IRS employees to “capture, curate, store, search, share, transfer, perform deconfliction, analyze and visualize large amounts of disparate structured and unstructured data.” (IRS Contract Proposal, Performance Work Statement, Jan. 11, 2017, p. 1.)

Palantir is to build and train the IRS to use a unified supercomputer to:

search, analyze, visualize, and interact with a wide variety of disparate data sets so users will be able to leverage the platform to perform advanced analytics, such as link, pattern, statistical, behavioral, and geospatial analysis on an investigative platform that is scalable and interoperable with existing IRS equipment and systems. (Ibid, p. 2.)

What kind of data are we talking about? The contract proposal specifies the following data formats:

·        Oracle, MySQL, and PostgreSQL databases;

·        Delimited files (.csv, .dsv, .log, or .txt);

·        Excel files (.xls, .xlsx);

·        GraphML files (.graphml, .xml);

·        IVML files;

·        Email files (.eml, .pst, .mbox, .msg, .ost, .txt); and

·        PCAP files (.pca, .pcap, .pcp). Ibid, pg 20.

Ingesting massive amounts of data

The contract proposal states that the IRS is looking for an “analytical platform with a strong storage and indexing power allowing for rapid integration and analysis of ultra-large scale data sources.” (Ibid, p. 2.) Specifically, the system must meet the following criteria:

·        Allow for the rapid ingestion of massive amounts of data.

·        Users should be able to immediately use the imported data in the imported format to perform queries, analysis and identify links.

·        Allow users to drill down on massive amounts of disparate data to find connections.

·        Allow users to visualize connections from millions of records with thousands of links by grouping data visualization by the commonalities and roles. (Ibid, p. 20.)

This would allow the IRS to meaningfully link tens of millions of tax returns, billions of information returns, and trillions of bank and credit card transactions, phone records and even social media posts. For example, if a U.S. citizen moves money from a Swiss bank to some other offshore bank, then uses credit or debit cards to spend the money in the U.S., Palantir’s software can link those transactions. It could also flag a person whose tax return shows relatively low annual income but whose social-media posts indicate something entirely different.

This is exactly the kind of data analysis it will take to establish the IRS’ so-called “up-front tax system,” which I describe in my book “How to Win Your Tax Audit.” Under that system, the taxpayer is essentially removed from the tax preparation process because the IRS knows everything there is to know about your personal, business and financial affairs to the point where the agency prepares the return for you. How’s that for tax simplification?

The cost of spying

The IRS began working with Palantir in 2013. The agency spent $30.8 million on a five-year contract and granted Palantir access to files for more than 1 million people, according to a July 28, 2015, audit report. That contract provides the IRS with access to spy software for use by special agents (criminal investigators) “to generate leads, identify schemes, uncover tax fraud, and conduct money laundering and forfeiture investigative activities.” (Case Lead Analysis, PIA ID No. 1120, July 28, 2015, p. 4.)

Under the September 2018 deal, the government will pay Palantir $98,750,546.94 over seven years to fulfill the contract. My question is, why the extra 94 cents?

If the IRS’ $99 million spy software works as promised, the agency will have unprecedented ability to track the lives and transactions of tens of millions of American citizens


 

  • Like 2
Link to comment
Share on other sites

Remember when the IRS used to do lifestyle audits?  They could look at your income and see if it supported the kind of house you live in, the car you drive, the schools your children attend.  These were stopped when it was determined that they were essentially invasions of privacy.  Now they can do them only when they have reason to suspect a mismatch between income and expenses, e.g., you make $28k and have $20k in mortgage interest, or drive to the audit in a $90k car on your $28k income.  If you are under audit they can always subpoena your bank and cc records, but not before you are placed under audit.  This article suggests they will now pre-audit everyone.  I would like to say I doubt it, or that a similar ruling that put the kibosh on the lifestyle audit will come around, but the explosion in AI makes me no so sure.

  • Like 1
Link to comment
Share on other sites

On ‎1‎/‎25‎/‎2019 at 9:50 AM, Abby Normal said:

Some folks are born silver spoon in hand
Lord, don't they help themselves, oh
But when the taxman comes to the door
Lord, the house looks like a rummage sale, yes

John Fogerty & CCR

It ain't me, it ain't me, I ain't no fortunate one.

  • Like 1
  • Haha 2
Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...