Posted by Jay Livingston
I used to say, “The data are.” Pretentious I know. But no more. Now I’m a “the data is” kind of guy.
I’m not alone. Here’s the chart from Google n-grams, which also shows that we’ve become steadily more data-conscious.
When we talk about “the data,” we are referring to a whole -- a large thing made up of lots of smaller similar things. The word data is plural only in the most technical sense – it’s plural in a foreign language. The trouble is not that the language is foreign or that nobody speaks it. The problem is that data is a plural of a word that in English has no real singular. Nobody talks about a datum. When we select a particular instance in our data, we call it a “data point.”
It’s like spaghetti, another plural word in a foreign language. Spaghetti refers to a lot of similar things all combined to create a whole thing, a dish. We speak of that ensemble as a singular thing. We don’t say, “The spaghetti are delicious.” If we were speaking Italian, then yes, we would follow Italian grammar and use the plural “Gli spaghetti sono deliziosi.” And in Latin we would use the plural conjugation for data. But we’re speaking English.
----------------------
I have two Italian cookbooks on my shelf – gifts from people who thought my Italian is much better than it actually is – but I’m not going to try searching for something that probably is not there.
No comments:
Post a Comment