Thursday, September 21, 2023

Data is Not Greater. Part 2.

There was a book review in the cheery-neo-fascist Wall Street Journal over the weekend that picks up on my post from yesterday and I think, is well-worth reading. Because the Journal has a paywall that is harder to violate than a Pennsylvania prison, I'll paste the article at the end of this post. 

I believe in supporting journalism and paying for subscriptions. But I'll gladly steal from the Murdoch family. I wouldn't mind if they were somehow tortured like Prometheus bound. That would make me smile and gloat like one of their so-called newscasters. However since Rupert can't be chained to a rock to have his gizzards eaten each day, I'll have to content myself  with pirating his content and making it available.

Yesterday, I wrote a post about athletes giving up their data-driven running watches because the mania around data was destroying their love of their sport. Today, I'm writing a post about a book called "Optimal Illusions." 



To my dumb eyes, it makes at least four points.

One: We should beware the "deification of data."

TwoOptimization is a mathematical procedure purported to be rigorously objective and free of biases. But the mathematical analysis is often divorced from reality. 

Three Systems only work fantastically, when everything’s working fantastically.

Most important,

Four: The basic skills needed to manipulate large datasets are taught everywhere today, but few students receive much grounding in the limitations of the data or the ethical implications of their work.

I remember working on a direct marketing campaign thirty years ago for a luxury German car. The data the company received from consumers presupposed that car-buyers based their buying decisions based on one of five qualities: value, safety, performance, comfort and luxury. Their direct responses were tailored based on which single attribute the customer had chosen.

I think a lot of "analytics," even today, is similarly marked by a lack of sophistication. My sense is that buying decisions are made by a complex and ever-changing web of factors. Trying to identify or isolate those factors, or rank them, or tailor marketing to them is all-but futile.

If I were to fill out a survey, I'd answer that I like tall blonde women. But I've been married to a short brunette for nearly 40 years. In fact, my head's been often turned by a variety of shapes, sizes and hues. That doesn't make me a letch, or non-discriminating. It makes me human. 

I'm not out of step with the majority of data, the data is out of step with the majority of humans.

Finally, I'm not anti-data.

I'm anti the "deification of data."

One of my favorite writers, Mark Harris once said, "the only hero is the man without heroes."


In other words, don't wholly put your trust in anything or anyone. Everything and everyone, even the most exalted have flaws. That's even true for me.

Or so I'm told.

----

‘Optimal Illusions’ 

Review: Efficient to a Fault

A mathematician makes the case that our obsession with data and algorithms leads us to miss the bigger picture.

Sept. 17, 2023 2:46 pm ET

image
PHOTO: ALAMY STOCK PHOTO

The logistics business is a creature of mathematics. Algorithms dictate the most efficient plan for unloading boxes from a container ship, stacking them in a freight yard, managing the truck fleets that transport them to distribution centers and delivering their contents to millions of households. Yet when international trade in goods surged during the Covid-19 pandemic, all those algorithms failed to keep supply chains from becoming seriously tangled. Their dictates, it seems, were no longer optimal.

In “Optimal Illusions,” mathematician Coco Krumme explores why the quest for the best so often falls short. The hidden assumption behind optimization, she notes, is that all relevant variables can be identified and accounted for in order to achieve the ideal outcome. Yet unknowns and uncertainties can upend the most carefully laid plans. “Optimization was invented, over time, with the idea of making a system, or a society, more perfect,” she writes. But “we often see the opposite happening.”

Optimization, Ms. Krumme asserts, has arisen from three shifts in how humans conceive of the world. The first shift is an increasingly atomized understanding that leads us to see individuals as assemblages of tiny bits of data. The second is a tendency toward greater abstraction, especially in computer modeling. The third is automation, which changes scale, devaluing the local and the particular. Over time, Ms. Krumme contends, these shifts have led us to understand reality in new ways—and to believe that we can perfect it.

Optimal Illusions: The False Promise of Optimization

By Coco Krumme

Riverhead Books

272 pages

drastically reduced the number of people facing starvation. Eventually, however, it had unforeseen consequences, such as the displacement of diversified family farms by bigger operations growing a single crop with large doses of fertilizers and herbicides.

“The deification of efficiency cast aside traditional methods of letting soil rest and regenerate,” Ms. Krumme writes. “Its focus on increasing yields divorced agriculture from farmers, and farmers from land and place.” Rural communities eroded. Small processors died out. And the surviving farmers, while more prosperous most of the time, became more vulnerable to price swings because their incomes depended on the value of a single crop.

Optimization, Ms. Krumme says, also lies behind the breakdown of the electric grid in Texas during the frigid winter of 2021. The grid sits atop a mountain of algorithms that project power demand, forecast the output of individual generating plants, and allocate energy based on the prices customers and suppliers are willing to pay and receive. “It’s a system optimized to work fantastically, when everything’s working fantastically,” she observes.

But optimizing investment had squeezed out excess generating and transmission capacity, while the models hadn’t accounted for a prolonged spell of sub-freezing temperatures. When customers turned up the heat, the electricity they counted on was to be had only at extortionate prices, if at all. What was optimal in certain circumstances turned out to be suboptimal when the weather changed.

What’s the problem? Optimization is a mathematical procedure purported to be rigorously objective and free of biases. The basic concepts are taught in every calculus class. A simple computer program can do the heavy lifting. “We start off with what we imagine is a blank slate,” Ms. Krumme writes. “We throw in piles of data and let the computer code spit out the ‘best.’ ”

But the mathematical analysis is often divorced from reality. Constraints may be ignored, misunderstood or forgotten. The specific knowledge possessed by individuals who know the idiosyncrasies of a machine or a market is disregarded. And the slate on which a model is built may not be blank at all. While genetically modified crops may be optimal in terms of tons per acre of farmland, how does one weigh the loss to growers who wish to continue planting traditional varieties but cannot do so when the local farm co-op no longer wants to purchase and store what they’ve grown?

Moreover, optimization cannot readily be undone. The Colorado River Compact of 1922 optimized the distribution of water based on the flows and consumption patterns that prevailed at the time, but it became far from optimal once average precipitation declined, Mexico negotiated a share of the river’s flow, and population growth in places like Los Angeles and Las Vegas vastly increased the demand for water.

Ms. Krumme is aware that optimization techniques have brought enormous benefits. Her complaint is that they have also “come with trade-offs: the loss of slack or downtime, the abstracting away of place and particulars.” They have “led to a loss of human scale, making events like the Texas-grid breakdown more frequent.” One reason why, she laments, is that over time the underlying assumptions behind algorithms fade into obscurity, and certain basic questions are no longer asked.

At times, Ms. Krumme veers toward the romantic, longing for a simpler past. Some readers may find her jabs at “efficiency’s overreach” off-putting. Nonetheless, her warning about quantitative excess deserves attention. The basic mathematical and computer skills needed to manipulate large datasets are taught everywhere today, but few of the students who learn these skills receive much grounding in the limitations of the data they are analyzing or the ethical implications of their work. Ms. Krumme’s provocative, brisk and refreshingly nontechnical book offers an important corrective.







No comments: