Hey all!
I’ve updated the document to include SEC insiders. This data is a bit
tricky because it pulls for all entities that have a Central Index Number
(CIK#), which include businesses. So, while I have cleaned it to just
include individuals, there are still some businesses within the document.
Over time it is something I hope to have cleaned up entirely.
As usual, I won’t post the link here so please email me for the direct link
(*for those who downloaded the original document, the link is now changed
due to the wonkiness of Google Sheets*). Also, I’m not in the office today
and tomorrow, so I will get back to everyone’s request for the link by
Friday morning.
*SEC Insiders*
*Culled from every entity that has a Central Index Number (CIK#) from the
SEC registry. Document contains some businesses (<5%) due to data cleaning
errors. Some dupes remain. 230,508 records*
p.s. Next up is patent data, which may take me a bit.
On Tue, Jun 4, 2019 at 9:00 AM Steve Grimes <
grimessjr@gmail.com> wrote:
> Hey everyone! I posted the below over on PRSPCT-L but I follow this
> listserv also, and I figured it would be of use to those who don't follow
> PRSPCT_L.
>
> Hey all!
>
> So, I’ve been working through open data sources over the past couple of
> months. For those who don’t know, you can think of open data as public data
> repositories that everyone has access to. Some of these include things like
> political giving contributions, business ownership, property valuations,
> etc.
>
> At my org, we have a limited budget for screenings and I’ve been using
> open data as a workaround to screen everyone in our database. I have a
> workshop coming up with APRAGNY in July and a webinar with NEDRA in
> September to go through my workflow with these data sources. In both
> presentations, I will present how I use Excel to parse through and match
> records with some of the other creative things you can do with this sort of
> data to supplement or replace your current screening setup.
>
> However, before then, I would like to share some of that data I have been
> collecting with anyone who might find it useful for their purposes. Because
> there might be a gray area here with PRSPCT-L guidelines
> <https://www.aprahome.org/page/prspct-l/guidelines>, I will ask that you
> email me for the document (
sgrimes@jazz.org) and I will provide you a
> link for download.
>
> Below is a description of what you will get in that file (FYI, it is
> somewhat NY based, but there is data here that anyone can use no matter the
> state you’re located in). Also, if you are already using R or Python, you
> will know how to use that software to match the records in your data base.
> However, if you are not familiar with those tools, you will need Fuzzy
> Lookup <https://www.microsoft.com/en-us/download/details.aspx?id=15011>
> to match records in Excel (I will be covering all of this in my
> presentations).
>
>
>
> *Open Data Wealth Data Set*
>
> *World’s Billionaires*
>
> *Fields are explanatory; 2,656 records*
>
> *2019 NYS Political Contributions*
>
> *Fields are explanatory; Worksheet contains only individuals who have
> given a one-time contribution of $1000 and above; Dupes removed; 3,041
> records*
>
> *2018 NYS Political Contributions*
>
> *Fields are explanatory; Worksheet contains only individuals who have
> given a one-time contribution of $1000 and above; Dupes removed; 12,111
> records*
>
> *2017 NYS Political Contributions*
>
> *Fields are explanatory; Worksheet contains only individuals who have
> given a one-time contribution of $1000 and above; Dupes removed; 14,501
> records*
>
> *2020 Federal Political Contributions*
>
> *Fields are explanatory outside of “Company” and “Position” which lists
> where a person was employed at the time of contribution (self-reported);
> Worksheet contains only individuals who have given a one-time contribution
> of $5,000 or above; Dupes removed; 7,154 records*
>
> *2018 Federal Political Contributions*
>
> *Fields are explanatory outside of “Company” and “Position” which lists
> where a person was employed at the time of contribution (self-reported);
> Worksheet contains only individuals who have given a one-time contribution
> of $5,000 or above; Dupes removed; 52,495 records*
>
> *2016 Federal Political Contributions*
>
> *Fields are explanatory outside of “Company” and “Position” which lists
> where a person was employed at the time of contribution (self-reported);
> Worksheet contains only individuals who have given a one-time contribution
> of $5,000 or above; Dupes removed; 61,243 records*
>
> *NYC Property Valuations*
>
> *City valuation of property; “Owner” contains companies and individuals;
> “FULLVAL” includes properties valued at $1M and above; Dupes removed;
> 76,898 records*
>
>
>
>
>
>
>
>
>
> *Steve Grimes Director, Development Analytics and Strategy Jazz at Lincoln
> Center Venue / Frederick P. Rose Hall / Time Warner Center, 5th Floor
> Offices / 3 Columbus Circle, 12th Floor, New York, NY 10019
>
sgrimes@jazz.org <
sgrimes@jazz.org> / jazz.org <http://jazz.org> P 212 258
> 9985 F 212 258 9900*
>
>
--
Steve Grimes
718.795.8026