BI Tools for Gerrymandering
This excellent article in the New Yorker describes the current redistricting process and the gerrymandering that's taken root. What I found intriguing is the use of Maptitude software from Caliper. By feeding as much census and election data as possible into the software it is possible to point-and-click your way to a safe congressional seat in the next election.
With a few clicks, Persily changed the map from one that showed party registration in each census block to one that revealed voting results in each block. The colors ranged from dark red, for heavily Democratic votes, to dark blue, for strongly Republican. He showed voting results in about two dozen races, from President to governor and from congressman to local offices. "The whole process has got much more sophisticated" Persily went on. "Party-registration data are not the only kind of data you want to use. You want to use real election results. That's a big change from ten years ago. We have become very good at predicting how people are going to vote. People's partisanship is at a thirty-year high. If I know you voted for Gore, I am better able to predict that you are going to vote for any given Democrat in a future election."The use of Maptitude reminds me of hospital third-party billing software I worked on in the mid-eighties. The primary, secondary and tertiary diagnoses determine the allowable rates for insurance and medicare billing. In general, one combination provides a better total rate than the others. It was illegal for a computer system to determine the ordering, but the person doing the data entry could. So they asked us to write software that would provide every combination of diagnostic combinations and display them with the billing amounts. The user then selected the one that had the highest rate. While not illegal it definitely subverted the intent of the laws at that time.
This redistricting software does much the same thing to the political redistricting process. There was never an intent to let politicians in power redistrict to make it impossible for them to lose elections. Now that computers and good BI software make it as simple to geryymander as clicking a mouse, we need laws that explicitly forbid this practice. The problem is that two thirds of politicians in office hold safe seats because of the subverted process, so it's unlikely that any reform will take place.
I'm also embarrassed to note that my own alma mater (CMU) played a part in the Pennsylvania redistricting that's now in court.
Posted by Mark Friday, December 05, 2003 8:16:00 PM |
I've only worked with MySQL a little bit, so I'm not yet comfortable using it for data warehousing. I'll be researching Open Source databases in a lot more detail this spring in advance of a few conferences, with the goal of pumping some large volumes of data into them to see how they handle the usual range of business intelligence tasks. Expect a spate of magazine articles come mid-summer on the results, assuming my hardware is set up in time.
While catching up on some reading I came across this useful page about some of the more annoying features of MySQL. The author doesn't treat these as bugs since they are mostly documented and designed-in behavior. For example:
If no DEFAULT value is specified for a column, MySQL automatically assigns one, as follows. If the column may take NULL as a value, the default value is NULL. If the column is declared as NOT NULL, the default value depends on the column type: (...)
I did not realize that MySQL's treatment of NULL values was as strange as the examples he walks through. Worth a read if you are starting to do any development work with MySQL (not just DW experimentation) and you are already familiar with commercial and standards-compliant databases.
Posted by Mark Wednesday, December 03, 2003 11:14:00 PM |
Excel for Visually Immersive BI
No BI tool is complete until it can handle sophisticated graphical applications. I've only recently learned that Excel makes the grade, thanks to this nice demo of a sophisticated GUI(291K).
Learning how to do this makes spreadsheets much more fun, sorta like when Visicalc was first released.
[Note: there has been some trouble downloading the file from it's original source so it's mirrored here just in case]
Posted by Mark 10:40:00 PM |