Cold Spaghetti :: The one where Holly complains about statistical packages and software in general

{ 2007 10 26 }

The one where Holly complains about statistical packages and software in general

Big stresses are not what will finally, in the end, drive me to insanity. It will be the culmination of the little things that finally pushes me off the ravine upon whose edge I live so perilously close.

These little things, things that should be easy but somehow aren’t, are most commonly presented in my life in the form of Stupid Programing Issues made in the statistical packages with which I am sometimes called to use for my work. Most of these packages (STATA, SPSS, SAS) are incredible expensive and call for both the privilege of access to them and then climbing the learning curve required to know how to use them. (Understanding what those numbers actually mean is a serious problem within the sciences themselves… poorly trained social, behavioral, and medical scientists, lack of good theory and critical thinking in higher education programs… I could go on, but this rant is about software.)

There is one free package, made by the CDC. It is what statistical software should be in public health: free, amenable to operation in older systems, fairly easy to learn, have access to standard comparisons of health/nutritional indicators. It is commonly used in the field for the reasons listed above. But it is not particularly powerful in analysis. Although it is not hard to manipulate for basic information when you’re set up in it, getting to the point where your data is in the system and correct is difficult — the software is not user friendly. Paul keeps telling me that we should write a grant to fund him making a new package. Something that would offer more statistical power but be a bit more intuitive in its interface. This does not sound like a good idea. If we did this and people found out, I fear our doorstep would be darkened daily by strung-out graduate students seeking vengeance.

And right now, I’m beating my head against the wall because the damn thing won’t read my run files. Don’t get me started on how many times this thing has crashed. It is insisting on a click-by-click dummy entry of things and just making my life really suck. It would take all day to recode half of these variables without a run file. I am ranting only because I decided to chuck the version and download an updated package and need a vent while I wait. I could do these things in STATA so much faster, and have considered changing the data and studying it elsewhere and then loading it back to Epi Info, but I think I need the practice here. I’m trying to write a lab assignment and need to be able to test that these things work before unleashing Master’s students on it (they are already nervous and unsure about the whole “lab” thing).

What is really firing me up is that these are REALLY SIMPLE sort of things I’m struggling with. User error is always a factor, although the same command that works in one second is full of syntax errors in the next. Wa…? Unfortunately, the “HELP” aspects of the software are miserable.

Which brings me to a point: if I am having problems here, in my cushy home, with all the resources around me I need to figure out a solution, how can we rely on this in the field? Shouldn’t a profession like public health, with such important implications in surveillance and data survey, have truly excellent caliber software — free for use — and widely available? And if Epi Info is our current solution to this issue, then for heaven’s sake… why isn’t it available for Mac OS (in the least) or Ubuntu?

Anyone out there a big data geek who knows Epi Info and is ready for some questions???

{ 3 }

Comments

laloca | 26-Oct-07 at 11:36 am | Permalink

uff. i’d love to help, but i’m an SPSS user myself. the one time i tried epi info – nearly a decade ago, now – i absolutely loathed it.

one minor thing… your US bias is showing. “the States” vs. “the field”?
Cold Spaghetti | 26-Oct-07 at 11:55 am | Permalink

What, you don’t like having to consult a history file to see the results on a simple FREQ? Arrr. The newest version came out a few days ago but the download link is broken…? So after uninstalling and all that nonsense, I had to install the same version I already had. All things considered, it really isn’t a horrible basic package. Particularly if the folks using it aren’t whizzes with Epid and Biostats (like the MPH students I’ll be teaching this to).

I’m not nuts about the way SPSS handles hierarchal data and like having the flexibility in STATA for different types of modeling (probit, tobit… etc.) It’s not great for graphs. (SPSS makes prettier pictures.) Just depends on the level of analysis you’re going for in your study, I guess. Biostats folk swear by SAS, which was what we used in lab at UM. It may also be a trained response as my best coursework used STATA and the sucky ones used SPSS.

Hmmm. States, field. Yeah, good catch. Shows how fast my mind was pouring while venting. Interesting that by “field” I really meant “without internet” — as the very data I’m hammering on was entered “in the field” in emergency shelters after Hurricane Katrina. (This is what I was thinking about really — how if you were using this to track immunization levels or chronic conditions and needed the info right away, as you would in a post-disaster scenario, problems with the software would be a major bummer.) An interesting slip, though, as one could argue that Southeast LA is, resource-wise, better compared to more vulnerable areas of the planet than with “the States.” Apparently, my subconscious is turning to the dark side where the world is suddenly divided into a 49 State country (minus Louisiana, of course) and then the rest of the world… a “field” of roting vulnerability and need. Wow. My transformation into the future international health worker must be complete! (Kidding. I’m not quite that jaded, yet.)

Now that you pointed it out, though, it will continue to bug me, so I’ll fix it.
laloca | 29-Oct-07 at 7:13 am | Permalink

My transformation into the future international health worker must be complete!

heh. my mental image upon reading this was of you emerging from a chrysalis of poorly-annotated data files, slide rule in one hand and AID grant applications in the other.

not that anyone uses slide rules anymore. except perhaps my mother.

my spss bias is really easy to explain: i’ve used it since 1988. yeah, nearly 20 years, now. no wonder i went to law school. (hunh?) the latest version i’ve seen (for the mac) is so different from the mainframe version i ran my bachelor’s thesis data on, i hardly recognized it.

i also used shazam, an econometrics package, way back in the day. don’t remember much of it, though.

the danger of any of the stats packages, though, is use by a person without sufficient statistics understanding. it’s easy to run completely meaningless tests. if i didn’t have all my textbooks still hanging around, i’d probably make that mistake myself.

Cold Spaghetti

The one where Holly complains about statistical packages and software in general

{ 3 }

Comments

Post a Comment

Home

Main Course

Cold Spaghetti

About Cold Spaghetti

Speak to the Chef

Featured Dishes

Friends & Family

NOLA noteworthy

Notable and Newsworthy

Supportively Stalking

Menu

Order Up

Nods to the Kitchen

Archives

Cold Spaghetti

The one where Holly complains about statistical packages and software in general

{ 3 }

Comments

Post a Comment

Home

Main Course

Cold Spaghetti

About Cold Spaghetti

Speak to the Chef

Featured Dishes

Friends & Family

NOLA noteworthy

Notable and Newsworthy

Supportively Stalking

Menu

Order Up

Nods to the Kitchen

Archives

A-la Carte