Lisa Gold: Research Maven

Entries categorized as ‘Amazon’

Time-traveling through the English language with the Historical Thesaurus of the Oxford English Dictionary

October 27, 2009 · 7 Comments

Matt and I recently had the opportunity to spend some time browsing through the Historical Thesaurus of the Oxford English Dictionary, which has just been published by Oxford University Press.

I began by reading the introduction. He began by looking up curse words. Once he had satisfied his curiosity about when certain very popular profanities first entered the English language, he turned his attention to the more unusual words within the inferior persons, as abused subcategory that have fallen out of use, such as windfucker (1602 to 1616), hog-rubber (1614 to 1621), chuff-cat (1653), shit-sack (1769 to 1785), and son of a sea-cook (1806 to 1977). This led to an animated discussion of the common themes that connected many of the words—comparisons to animals, sex with relatives or objects, and the inability to control one’s bowels.

That’s what happens when you put the Historical Thesaurus of the Oxford English Dictionary within reach of a writer.

Browsing this work feels strangely like time-travel. All the words from Old English to 2003—obsolete and current, including slang and dialect—have been extracted from the Oxford English Dictionary and organized by their meanings and dates of use. This places each word within its historical context, revealing how ideas and meanings emerged and the different ways they’ve been expressed through time.

It took forty-four years to bring the HTOED to publication, overcoming what the editors politely describe as “a series of intellectual, financial, and domestic challenges.” About 800,000 meanings from the OED were transcribed onto slips of paper and organized into a unique classification system with over 236,000 categories and subcategories. A fire in 1978 would have destroyed a decade of work but for the fact that the paper slips were stored in a metal filing cabinet. They could have finished making slips by 1980, but the decision was made to add new material from the second edition of the OED and the supplements. Computers were eventually used to enter, store, and retrieve data, but much of the work continued to be done by hand.

The result is the world’s largest thesaurus, nearly 4000 pages of small type in two big volumes weighing fifteen pounds, with a slipcase and folding chart of the top levels of the classification system. I like print references because browsing can lead to serendipitous discoveries, but these books can be awkward to use. It’s especially frustrating when looking up a word with multiple meanings, as the index may list dozens of identification numbers, which means lots of page flipping. No, it’s not available online or on CD, though that may eventually change. I’d like to see the powers-that-be at Oxford University Press quickly add the HTOED to the online OED so both works can be used together and fully cross-referenced and searched.

The classification system of the HTOED is mind-bogglingly complex, forming a hierarchy of meaning from the general to the specific. At the highest level are the three main sections—the external world, the mental world, and the social world—which divide into 26 major categories, such as the earth, life, emotion, society, morality, faith, armed hostility, and communication. These branch into more detailed categories like food, clothing, people, animals, transport, love, moral evil, and sexual relations. More specific categories and subcategories lead to the synonyms and related words, which are organized by part of speech and listed chronologically with the date of the first recorded use in English and, for obsolete words, the last recorded use. (I recommend reading the “guide to the use of the thesaurus” to get your bearings.)

Each level in this hierarchy of meaning is assigned a two-digit number, which when combined creates identification numbers for every word in the thesaurus. Some words have many identification numbers because they have numerous meanings or have changed their meanings over time and thus appear in different locations within the thesaurus.

For example, in the alphabetical index, the first identification number for the noun serendipity, one of my favorite words, is 01.05.05.10.02.01|10.01, locating it in the thesaurus within these nested categories and subcategories:

01                                                         the external world
01.05                                                  existence in time and space
01.05.05                                           action/operation
01.05.05.10                                    endeavour
01.05.05.10.02                             searching/seeking
01.05.05.10.02.01 (n.)              finding/discovery (noun)
01.05.05.10.02.01|10               accidentally (subcategory)
01.05.05.10.02.01|10.01        faculty of making happy discoveries by chance

Here you’ll find that the noun serendipity was first cited in 1754. After the finding/discovery (noun) category is the finding/discovery (adjective) category, in which serendipitous (01.05.05.10.02.01|03) dates from 1958.

The HTOED will clearly be important to the study of the English language, but it also could contribute to other subjects, especially history, literature, and culture. The descriptions of life and the earth over centuries are like crash courses in the history of science and medicine. Cultural historians will look for clues in our language to our attitudes about gender, race, and class, as with the words used to describe women based on animals (mare, hen, cow, heifer, bird) or clothing (skirt, smock, petticoat). Advancements in technology are reflected in subjects like travel, tools, telecommunications, and computing. Shakespeare scholars will be able to compare the words in use during his lifetime and argue about the reasons for his word choices. Even a category like clothing can reveal shifts in morality, as when underwear became unmentionables in 1823.

I believe the HTOED could be a rich source of inspiration and world-building for writers. Historical novelists could gain insight into the past and how people lived, what they knew and believed, and how they described their own world. And they’ll know whether the words their characters are speaking were actually in use at the time. (Elizabethans would not have called a packed meal a picnic, as it was first cited in 1748.)  Fantasy writers may unearth ideas in forgotten names or descriptions of supernatural beings and mythical creatures. Poets can reintroduce lyrical and imaginative words that have fallen out of use, such as candel (Old English to 1634), luminair (1456 to 1560), or streamer (1513 to 1647), all of which once described heavenly bodies. Eclectic writers like my husband who have a strong love of word-play and enjoy collecting unusual bits of knowledge will find it addictive.

Let’s say you’d like to take advantage of the current craze for vampires or literary monster mash-ups like Pride and Prejudice and Zombies. The HTOED can tell you when different monsters first entered our nightmares and what we called them at distinct points in time. Follow the hierarchy of categories from the external world to the supernatural to supernatural being/spirit to malignant monster (noun). Here you’ll find that the word vampyre was first cited in 1734, followed by vampire in 1796. Though vampire is still in use today, the last recorded OED citation for vampyre was in 1847. Referring to vampires as undead didn’t begin until 1897. Werewolves trace all the way back to the Old English werewulf, lycanthrope was first cited in 1813 and is still in use, but the more poetic turnskin entered the language in 1831 and exited forty years later. Oh, and zombie was first cited in 1819, two years after the death of Jane Austen.

The editors have included all those words that have been too controversial for some other dictionaries and thesauruses. Curse words, sexual slang, and offensive slurs for racial and sexual minorities appear dispassionately in their chronological place among their less inflammatory cousins. Reading certain entries may cause shock, disgust, or pain, but there is value in putting these powerful words in their historical context. If you are easily offended or prefer your works expurgated, consider yourself warned.

Priced at $395 (on sale at Amazon for $316), the HTOED will unfortunately be out of reach for many of the writers and word lovers who might appreciate it, so keep it mind if you are looking for a fabulous gift for your favorite logophile.

For more information, check out this OUP website for the HTOED and this OUP blog post with “fun facts and figures” about the work. Here’s the link to a sample page from the work at the OUP website.

UPDATE, 10/28: I received an email from Christian Kay, editor of the HTOED. There are indeed plans to eventually link the HTOED to the OED online and make it available to subscribers, but that could be a couple of years away. There are no plans for a CD version. So it looks like the books will be the only option for quite some time.

Categories: Amazon · Authors · Books · Browsing · Dictionary · Fantasy · Historical fiction · History · In the news · Reference books · Research · Science Fiction · Searching · Shakespeare · Thesaurus · Words · Worldbuilding · Writer's Bookshelf · Writers · Writing

Unlike Jeff Bezos, I love the physical book

June 18, 2009 · 1 Comment

I live my life surrounded by books. My husband and I have thousands of them, old and new, in bookcases covering the walls of nearly every room of our house.

Our books are more than just texts. They are artifacts that express who we are and what’s important to us. They are time capsules that can take us back to a particular memory or moment in time. They are symbols of our relationships– with each other, with friends, and with the authors who inscribed their books to us. They are unique, collaborative works of art, a marriage of ideas, language, typography, illustration, and design.

Yes, physical books are heavy and sometimes awkward to handle in bed, but they do have certain advantages over ebooks. They can be read anywhere, anytime, without a special device– no worries about breaking or losing your reader or the batteries dying. There are no problems with formatting, DRM, technological obsolescence, or preservation for future generations. When I am finished reading a book, I can give it to my husband to read, donate it to my local library, or sell it and use the money to buy more books. When doing research, I can have multiple books open in front of me at the same time and easily browse through them.  (Browsing is very different from searching, and it often leads to unexpected and valuable discoveries. Format has an influence on the reading experience and the way we find and process information.)

I am not anti-ebook– I would love to have a Kindle DX if I could afford one. It would be a pleasure to travel with a Kindle instead of bag full of books, or to be able to download digital books instantly. But ebooks could never completely replace all of my physical books. I can’t replicate on a digital reader the experience of browsing through a facsimile of Shakespeare’s First Folio or a book with beautiful illustrations or photographs, or reading a colorful children’s picture book with my niece and nephew. I wouldn’t be able to share books I love or useful reference works with Matt unless we both have readers and there aren’t DRM restrictions on the works. And I just can’t imagine not having a bookcase filled with every different edition and translation of my husband’s novels, with their colorful and wildly different dust jackets, or the books inscribed to us by our author friends.

There are books you just want to read, and there are books you want to collect. Physical books and ebooks have different advantages and disadvantages, so there shouldn’t be a fight to the death between the two formats– there’s room for both. It would be great to always have both available and be able to choose each book in whichever format would be best for the individual reader. And I can see a lot of situations where I’d want both. I love the beautifully designed and illustrated first edition of Quicksilver that Neal Stephenson inscribed to me and Matt, but it would be great to also have a digital copy of the text to read on vacation, as his books weigh a ton. If I had a Kindle, the first things I’d download would be my favorite public domain works, like Pride and Prejudice and Jane Eyre and the works of Shakespeare, but they wouldn’t replace my physical copies. The real danger to someone like me is that the instant access Kindle gives me to thousands of books, old and new, would be hard to resist.

Why do Jeff Bezos and others who have fully embraced the ebook feel it necessary to dismiss or trash the physical book? Given the state of publishing and the rising influence of digital natives, I can’t help but worry about the future of not only the physical book, but also bookstores and author readings/signings, which help connect authors with readers, and readers with authors.

Categories: Amazon · Authors · Books · Bookstores · Controversy · Cultural treasures · Kerfuffles

“I kind of am grumpy when I am forced to read a physical book…”

June 17, 2009 · 5 Comments

Jeff Bezos, founder and CEO of Amazon.com, on why he no longer likes to read physical books:

I kind of am grumpy when I am forced to read a physical book. Because it’s not as convenient. Turning the pages … I didn’t know this either, until I started using the Kindles a couple months ago, I mean a couple years ago, I didn’t understand all of the failings of a physical book, because I’m inured to them. But you can’t turn the page with one hand. The book is always flopping itself shut at the wrong moment. They’re heavy. You can only take one or two of them with you at a time. It’s had a great 500-year run. [Audience laughter.] It’s an unbelievably successful technology. But it’s time to change.

This quote was transcribed by Carolyn Kellogg in the LA Times’ Jacket Copy blog from a video of Bezos in conversation with Steven Levy at Wired’s Disruptive by Design conference. Tim O’Reilly transcribed a bunch of other notable quotes from the video on his blog. You can watch the entire video for yourself at Jacket Copy or MobyLives or Wired Video.

Categories: Amazon · Books · Controversy · In the news · Kerfuffles · Quotes

Amazon acknowledges its “embarrassing and ham-fisted cataloging error”

April 13, 2009 · 4 Comments

On Monday afternoon, Amazon spokesman Drew Herdener released this statement about the AmazonFail fiasco to the LA Times and other media outlets:

This is an embarrassing and ham-fisted cataloging error for a company that prides itself on offering complete selection.

It has been misreported that the issue was limited to Gay & Lesbian themed titles – in fact, it impacted 57,310 books in a number of broad categories such as Health, Mind & Body, Reproductive & Sexual Medicine, and Erotica. This problem impacted books not just in the United States but globally. It affected not just sales rank but also had the effect of removing the books from Amazon’s main product search.

Many books have now been fixed and we’re in the process of fixing the remainder as quickly as possible, and we intend to implement new measures to make this kind of accident less likely to occur in the future.

Various sources are reporting that rankings have been restored to some of the affected books. I just spot-checked the specific titles I mentioned in my last blog post, and as I write this some have had their ranking restored, but most have not. Let’s see how long it takes to fix them all. Fixing the damage to Amazon’s reputation and restoring the trust and goodwill that’s been lost will take a lot longer and require more than just this PR statement. That Amazon has not handled this well is a monumental understatement. It would help if they issued an actual apology that showed some understanding or acknowledgment of why the Internet and Twitter exploded. It may have been an unintentional error, but the results and implications of that error and Amazon’s late and inadequate response caused pain to authors and readers alike and felt like a betrayal to those of us who’ve been Amazon customers for years. You can fix this, Amazon. Please try. Hard.

I’ll leave you with one last link for now: Kelley Eskridge’s great post on her Humans at Work blog about the management lessons to be learned from Amazonfail.

Update: On Monday night the Seattle P-I posted “AmazonFail: An inside look at what happened“:

I’ve spoken to an Amazon.com employee who works closely with the systems involved in the glitch… On Sunday afternoon at least 20 Amazon.com employees were paged alerting them that items, possibly many, were incorrectly being flagged as adult. The employees also received links to the Twitter discussion AmazonFail. Thousands of people were angry that gay-themed books had disappeared from Amazon’s sales rankings and search algorithms…

By this time, Amazon.com had upgraded the problem to Sev-1. (Amazon.com breaks down its operational issues in terms of severity levels. Sev-3 means a problem affects a single user. Sev-2 is a problem that affects a company, or a lot of people. Sev-1 is reserved for the most critical operational issues and often are sent up the management chain to the senior vice president level.)

“People got pulled away from their Easter thing when this whole thing broke,” the employee said. “It was just a screwup.”

Amazon.com employees are on call 24/7, and many began working on the problem from home. It didn’t take much digging to realize that there was a data error.

Amazon managers found that an employee who happened to work in France had filled out a field incorrectly and more than 50,000 items got flipped over to be flagged as “adult,” the source said. (Technically, the flag for adult content was flipped from ‘false’ to ‘true.’)

“It’s no big policy change, just some field that’s been around forever filled out incorrectly,” the source said.

New Update (April 14): All of the books I listed in my original blog post have had their sales ranking restored.

Categories: Amazon · Authors · Books · Censorship · Controversy · Crimes against literature · In the news

Amazon sets the blogosphere on fire

April 12, 2009 · 2 Comments

Thanks to Cheryl Morgan, I discovered that Amazon.com has done something ridiculously stupid and offensive– it has stripped the sales rankings from many books (fiction, nonfiction, and academic) with gay or lesbian subjects or characters, and it is excluding this “adult” material from some searches and bestseller lists.  People are posting lists of books affected online, and I did some quick author and title searches myself, so here are just a few of the books now missing their rankings:

  • Fiction:  E.M. Forster’s Maurice, D. H. Lawrence’s Lady Chatterley’s Lover, Radclyffe Hall’s The Well of Loneliness, James Baldwin’s Giovanni’s Room, Rita Mae Brown’s Rubyfruit Jungle, Jeanette Winterson’s Oranges are not the Only Fruit, Manuel Puig’s Kiss of the Spider Woman, Annie Proulx’s Brokeback Mountain, Sarah Waters’ Tipping the Velvet, and books by Nicola Griffith, among others.
  • Biographies, autobiographies, and memoirs:  Randy Shilts’ The Mayor of Castro Street: The Life and Times of Harvey Milk, Dan Savage’s The Committment: Love, Sex, Marriage and My Family, Quentin Crisp’s The Naked Civil Servant, and Gerald Clarke’s biography of Truman Capote.
  • History: David Carter’s Stonewall: The Riots that Sparked the Gay Revolution, Vito Russo’s The Celluloid Closet, Louis Crompton’s Homosexuality and Civilization, and Tin’s The Dictionary of Homophobia: A Global History of Gay & Lesbian Experience.

Mark R. Probst raised the alarm in his blog and posted Amazon’s response to his inquiry about this:

On Amazon.com two days ago, mysteriously, the sales rankings disappeared from two newly-released high profile gay romance books: “Transgressions” by Erastes and “False Colors” by Alex Beecroft. Everybody was perplexed. Was it a glitch of some sort? The very next day HUNDREDS of gay and lesbian books simultaneously lost their sales rankings, including my book “The Filly.” There was buzz, What’s going on? Does Amazon have some sort of campaign to suppress the visibility of gay books? Is it just a major glitch in the system? Many of us decided to write to Amazon questioning why our rankings had disappeared. Most received evasive replies from customer service reps not versed in what was happening. As I am a publisher and have an Amazon Advantage account through which I supply Amazon with my books, I had a special way to contact them. 24 hours later I had a response:

In consideration of our entire customer base, we exclude “adult” material from appearing in some searches and best seller lists. Since these lists are generated using sales ranks, adult materials must also be excluded from that feature.

Hence, if you have further questions, kindly write back to us.

Best regards,

Ashlyn D

Member Services

Amazon.com Advantage

The Meta Writer blog has a link roundup and has starting compiling a list of notable books that have had their sales ranking removed. Strangely, the Kindle editions of GLBT books still have their rankings– for now.

News of this is raging across the blogosphere, and there are more blog posts about it than I can link to, but here are a few more from Booksquare and Kelley Eskridge and Nicola Griffith. Amazon should quickly reverse itself or else this is going to grow to a public relations disaster of epic proportions.

Update:  Sunday evening news flash from Publisher’s Weekly:

A groundswell of outrage, concern and confusion sprang up over the weekend, largely via Twitter, in response to what authors and others believed was a decision by Amazon to remove adult titles from its sales ranking. On Sunday evening, however, an Amazon spokesperson said that a glitch had occurred in its sales ranking feature that was in the process of being fixed. The spokesperson added that there was no new adult policy.

Categories: Amazon · Authors · Books · Censorship · Controversy · Crimes against literature · In the news