Monday, December 27, 2010
The Primitive Pete
Saturday, December 18, 2010
Lying with Statistics...and PowerPoint
This doesn’t correct the problem, however, if the result being reported is, itself, incorrect because the analysis behind it was faulty. Still yet we must be reminded that it is extremely important in our search for the truth that we understand how it is very easy to lie with statistics and that we must always be on guard to keep deceptive or misleading results completely out of our activities. Further, we must avoid the unintentional pitfalls of the of graphical products such as PowerPoint that obscure our message, change our message, or become the message...because they will.
For today this essay is not about the mechanics of lying with statistics, the well known book, “How to Lie with Statistics” by Darell Huff covers this topic quite well. On the issue of Power Point, the books and publications about graphical presentations by Edward Tufte, such as “The Graphical Display of Quantitative Information”, pretty much cover the water front on that topic as well. Since in both cases, graphical presentation of statistics and PowerPoint are primary tools in our tool kit and will remain at our immediate disposal for sometime to come, what more can be done to extract truth?
The best way to avoid the pitfalls is to keep your perspective that you are the analyst. Tools and presentation products do not have the brains – you do. Long before you commit a number to a spreadsheet you should know exactly what trend you are trying to show or display. If you find yourself manipulating the data in the spreadsheet because you are looking for meaning in the data, you probably are beginning to walk on thin ice and should be wary. If the trend doesn’t just jump right off the page it’s possible that it’s not really there. This should be your first signal to start questioning what came before the spreadsheet, not what’s in the spreadsheet.
With regard to briefing slides, here again, you should have a pretty good picture in your mind with regard to the story you want to tell. If you let the story emerge while constrained by the software application, you are probably letting something creep into your message that you didn’t intend. When Marshall McLuhan said, "The Medium is the Message", he was serious about the underlying facts and content not being the message. The message is contained in the medium. Over and over again we find this to be the case. A well presented diagram of the facts trumps the facts in most cases. What this means if if you present a beautiful depiction of the relationship between two sets of data, it is the picture of this relationship that will be remembered in the mind of the decision maker. You better hope it was a correct relationship and not a misleading relationship. Time and time again we see presentations that show a wide gap between two alternatives. If the scale was changed, and the observer is not aware of the change in scale, they come away with a vision of separation that may or may not actually exist...the medium became the message. And the message was not necessarily true.
The best way to combat this is to decide on the truthfulness of your message first. This means do the thinking first and then use the tools at your disposal, not visa versa. In this way, the truth, as you will learn to discover, will not be obscured, influenced, or manipulated –either intentionally or unintentionally, by the external limitations some of which you were not even aware. Then your final product will be clear and logical and stand-up to the first wave of PowerPoint critics who enter the room...and those who leave the room will not be deceived.
Thursday, December 9, 2010
Essence of "Decision Points"
Jim's Review of "Decision Points" on Amazon
Saturday, November 20, 2010
What We are Taught
Thursday, November 4, 2010
The Ethics of Analysis
A car would be a better analogy. Clearly a car cannot be evil in and of itself (discounting the massive negative impact it’s life-cycle has on the environment). A criminal, as a new example, needs a getaway car, the driver of which may or may not be a witting participant. The car itself is but a tool to make a hasty escape. It is an inanimate and completely neutral object. The driver however, can choose to drive, or can question the passengers who hastily jumped on board with bulging satchels and what appears to be members of law enforcement in hot pursuit. If we as analysts did our job correctly, we would have obtained a fast car, we would have checked the traffic reports, we would have a good map with a planned route. We also would have a GPS for backup, a spare tire, and jack onboard. Wouldn’t want to be the get away driver feeling foolish asking for help on the freeway if a tire goes flat. In fact, we could do the very best planning and have taken great pride in our efforts only to find that we are simply in the midst of a serious crime.
How do we avoid this? Most bank robbers (sorry for the sterotype) want employees who don’t ask questions – coincidentaly most organizations want employees who are loyal and salute smartly – we call them team players. But here we find ourselves in one of the few professions where, in order for us to do our job to the best of our ability, we must ask many probing questions – questions that the boss might rather not answer. If you want the fastest travel time between the building on 17th street and the expressway, I recommend we drive the route before or after rush hour. But the bank is not open during rush hour – what a stupid question. Oh, I didn’t know you were going to be at the bank, I was just working on the fastest travel time. How long will you be in the bank, is there parking on the street just out front? During rush hour, a lot of people take the bus, it’s really hell being down town during those hours, can you take the bus? No. How about the subway? No. Why not? Because we will be in a Hurry? Why such a hurry, do you have to catch a plane? I wasn’t planning to drive you to the airport – but if that’s the case, perhaps a helicopter would be a better vehicle? I could pick you up in the park adjacent to the bank and pop you straight over to the airport.
Well that option is too expensive. I can’t afford a helicopter. OK, how much can you afford? Taxi drivers know the best routes and are the best drivers in the city at that time? Well I hired you to drive so I don’t want to hire a taxi cab? But clearly a taxi would be your best choice and since I want only the best for you I would advise you to fire me and plan on grabbing a cab when you come out. Well I need a driver who will take me any where I want to go? Yes, most cab drivers will. But I need the driver to be discrete? Ok, that is a new requirement; I was unaware of the discrete nature of your transportation needs. I can be very discrete – but I suspect so can a few cab drivers. But since you will not have control of exactly which cabbie you might hire, I think we can agree that I should drive. Yes I have already decided that you should drive – and I have already picked out a car and picked out the route for you to travel.
Oh, you just want someone to drive the car, not to help solve the problem, I’m sorry you just want a mindless, faceless, driver who doesn’t think. You’ve offered me a lot of money to drive this car for you, I think you could save a lot of money and hire a dumb driver for a lot less. Well I don’t want some incompetent ape driving the car – that would be worse than hiring a cabbie. Well that’s why I was trying to help, because I do think about the problems and typically can make the journey more efficient. What is it that you really are trying to do? We, I am robbing a bank and I need a get-away car.
Oh, why are you robbing a bank? If you need money and want to obtain it illegally, why not rob a convenience store, the get away is much cleaner. Not enough money in it for the risk. Ok, how about a grocery store? They have a lot of cash on hand. Well, I’m a bank robber, I’ve never robbed a grocery store. I wouldn’t know how to do it. Oh, well I’ve never robbed one either but I can definitely drive the getaway car. It would be much easier for me if the getaway is really independent of your action. But how about Internet fraud. If you really want money that is supposed to be a good way to get some – and it doesn’t require a get away car. I told you I am a bank robber – I would have to go back to school to learn how to use a computer – and you can’t teach an old dog new tricks.
Ok, looks like the bank is your only option. Have you considered robbing the bank at night when the streets are clear. Yes, but that requires that I hire a safe cracker – and have you ever had to work with a safe cracker. They are some of the hardest people to work with. They think the whole job is about them. Without them there is no bank robbery. I vowed to stop working with them a long time ago. What about tunneling into the bank and blowing up the safe. You don’t need a safe cracker for that. No you are right. Tried that before too. I need a bigger team – and the payroll on this job is tight. Can’t get the big boss to give me a bigger budget. Have you demonstrated that the rate of return on a safe job will be much higher than that of the daylight teller pull? Yes, he knows that, but he is expecting a few lower paying less risky jobs than a high risk, capital-intensive job. We can use you and your car on a couple of jobs the same day. Oh, multiple get-aways on the same day, this is a traveling salesman problem. I studied this one in school. Now I think I understand the problem completely and will optimize our route through town to hit all of the banks and then escape through the tunnel. Let me get started.
Friday, October 22, 2010
The Gauntlet
Sunday, October 17, 2010
Rest in Peace Benoît Mandelbrot
Friday, October 15, 2010
The Charlatan and Other Weasels
Saturday, October 9, 2010
The Knee Jerk Reaction of an Anti-Fluids Bigot
Thursday, October 7, 2010
The Truth About Profit
Thursday, September 30, 2010
The Zealot and His Disciples
Friday, September 24, 2010
The Empire Builder
Sunday, September 19, 2010
The Analyst: Bold--Gusting to Arrogant
Saturday, September 18, 2010
Frameworks for Understanding
Thank you Mooch for the opportunity to learn and share. Though I have long been an amateur thinker, the job title I have had for the past two years, "management analyst" asserts that I am now a professional thinker. Consequently, it is in my best professional interest to learn from my peers and avail myself to them, to the extent I am able. For now, I will not detail my background, because my current thought is that I want the ideas that I present to stand on their own. As you will see, they do require critique, maturing, and possibly expansion.
So, friends, without further introduction, here is one of my thoughts on "analysis." Analysis being defined as: seeking the truth of a particular matter with intent to understand in order to make useful decisions. To think productively, we need to understand the frameworks we use (one will not do, for reasons that appear intuitively obvious to me – let me know if you disagree; and to omit any one puts daylight between our analysis and reality/truth). Please let me know what other frameworks I have over looked. I will grant that they overlap, but I consider each a primary driver for how things work.
Frameworks:
Deterministic (because some things happen because another event caused them)
Random (because some things happen on a randomly distributed basis)
Chaotic (because some things happen on a non-random, non-periodic, unpredictable basis) Deliberate (because of free will)
Bias based (because we are human)
My goal is to increase in knowledge, understanding, and (some day) wisdom. Ogre
Friday, September 17, 2010
Lies, damn lies, and ...
“There is always a well-known solution to every human problem—neat, plausible, and wrong.” H.L. Mencken
On the subject of analytic and scientific truth…I haven’t entirely sussed out what this means to analysis, to analysts, and to me, but it leaves me questioning an awful lot of the things I’ve seen, read, and done. If nothing else, it leaves me with an even greater skepticism than I had when I woke up this morning.
I've been reading a little book titled Wrong: Why Experts Keep Failing Us--and How to Know When Not to Trust Them, by David H. Freeman. In this case, experts refers to “scientists, finance wizards, doctors, relationship gurus, celebrity CEOs, high-powered consultants, health officials, and more”—pretty much everyone who offers advice or conclusions in other words—and the book is all about the many and varied ways they (and we) get it…well…wrong most of the time. According to Freeman, we live in a world of “punctuated wrongness,” a world where, according to one expert (the irony here is intentional on my part and acknowledged on Freeman’s), “The facts suggest that for many, if not the majority, of fields, the majority of published studies are likely to be wrong…[probably] the vast majority.” This is a pretty stunning claim. In fact, if I think about this issue as a mathematician—the area of emphasis for most of my formal training and publication—I’m simply staggered by the claim. But my field is a little special I suppose, since “truth” (within the axioms) is pretty easy to spot. We may be the only discipline wherein one can actually lay legitimate claim to prove anything since ours is probably the only completely deductive intellectual endeavor. (That still doesn't mean we have any greater access to Truth, though.) In other fields of inquiry, the fundamental process is inductive—observe, hypothesize, observe, adjust, observe, adjust, etc.—and claims to proof are problematic in the extreme—which doesn’t stop anyone and everyone from using the phrase “studies show” as if they’re quoting from the Book of Heaven. But I also have a fair bit of training in statistics—both on the theory side and in applications—and one of Freeman’s explorations of “wrongness” really hit home.
Why do we use statistical methods in our research? Basically, we want to account for the fact that the world—as we observe it—is stochastic (although whether it is fundamentally stochastic might be an interesting debate) and ensure the measurements we make and the inferences derived from those observations are not (likely to be) statistical flukes. So, when we make a claim that some observation is “statistically significant” (not to be confused with a claim that something is “true”—a mistake we see far too often, even in our professional crowd) we mean there is some known probability—the level of significance—that we'll make a (Type I) mistake in our conclusion based on observing a statistical fluke. So, for example, a level of significance of .05 indicates (kinda sorta) a 5% chance that the results observed are the result of chance—and that our inferences/conclusions/recommendations are “wrong.” 1 in 20? Not so bad. How do we make the leap from there to “the majority of published studies are…wrong?”
As an exercise for the student, suppose 20 teams of researchers are all studying some novel hypothesis/theory and that this theory is “actually” false. Well, (very roughly speaking) we can expect 19 of these teams will come up with the correct ("true negative") conclusion and the 20th will experience a “data fluke” and conclude the mistaken theory is correct (a "false positive"). With me so far? Good. The problem is that this makes for a wonderful theoretical construct and ignores the confounding effects of reality—real researchers with real staff doing real research at real universities/companies/laboratories and submitting results to real journals for actual publication. Freeman has estimates from another set of experts (again with the irony!) indicating that “positive studies” confirming a theory are (one the order of) 10 times more likely to be submitted and accepted for publication than negative studies. So, we don’t get 19 published studies claiming “NO!” and one study crying “YES!” We see 2 negative studies and 1 positive study (using “squint at the blackboard” math)...and 2 out of three ain’t bad. (Isn’t that a line from a song by Meatloaf? I think it’s right before “Paradise by the Dashboard Light” on Bat Out of Hell. Anyway…) The other 17 studies go in a drawer, go in the trash, or are simply rejected. Cool, huh? Still…we don’t have anything like a majority of published studies coming out in the category of “wrong.” In the immortal words of Ron Popeil, “Wait! There’s more!”
Statistical flukes and “publication bias” aren’t the only pernicious little worms of wrongness working their way into the heart of science. “Significance” doesn’t tell us anything about study design, measurement methods, data or meta/proxy-data used, the biases of the researchers, and a brazillion other factors that bear on the outcome of an experiment, and ALL of these affect the results of a study. Each of these are a long discussion in themselves, but it suffices to say “exerts agree” (irony alert) that these are all alive and well in most research fields. So, suppose some proportion of studies have their results “pushed” in the direction of positive results—after all, positive studies are more likely to get published and result in renewed grants and professional accolades and adoring looks from doe-eyed freshman girls (because chicks dig statistics)—and suppose that proportion is in the neighborhood of an additional 20%. Accepting all these (not entirely made up) numbers, we now have 5 false positives from the original 20 studies. If all five of the “positive” studies and the expected proportion (one tenth) of the “negative” studies get published, we expect to see 7 total studies published, of which 5 come to the wrong conclusion. 5 of 7! Holy Crappy Conclusions, Batman! (Don't go reaching for that bottle of Vioxx to treat the sudden pain in your head, now.)
Freeman, following all of this, goes on to warn we should not hold science as a method or scientists themselves in low regard because of these issues. They are, in fact, our most trustworthy experts (as opposed to diet gurus, self-help goobers, television investment wankers, and other such random wieners.) They're the very best we have. Scientists are at the top of the heap, but “that doesn’t mean we shouldn’t have a good understanding of how modest compliment it may be to say so.”
CUMBAYA! It’s no wonder we poor humans muddle through life and screw up on such a grand scale so often! I need a drink, and recent studies show that drinking one glass of red wine each day may have certain health benefits…