marksskram

marksskram / SocrataOpenDataAnalysis / 0.1.6

README.md

Summary

The goal here is to combine my first two algorithms: /marksskram/SocrataOpenDataQuery which queries data given a dataset ID, domain, and query (an empty string is equivalent to a SELECT *) and /marksskram/SomeStats which calculates mean, variance, median, and more. 

See https://gist.github.com/marks/5c1c41df93b9cfa0f681 for a pretty version of the JSON output for the sample below

Feedback is always welcome. You can tweet me at @Skram or email me at mark.silverberg@socrata.com

Example Inputs to Try

The top 100 White House salaries ("salary" column)

["open.whitehouse.gov","rcp4-3y7g","select * order by salary desc limit 100"]

City of Chicago owned property ("sq_ft" column)

["data.cityofchicago.org","aksk-kvfp","select * where ward = 20"]


Known Issues
- Will probably break if you do not do a "select *"
- Need to document what happens with null and non-numeric values