Category Archives: Searching

I’m teaching a fact-checking workshop in Seattle on November 4th

I’ll be teaching a fact-checking workshop in Seattle on Saturday, November 4th for the Northwest Editors Guild. This will be a practical how-to workshop for anyone interested in accuracy–editors, writers, or readers–and registration is open to all. I hope you can join me! Here are the details:

Don’t Assume Anything: Practical Fact-Checking

Fact-checking is an important and useful skill for editors, writers, and
readers. But how can you tell whether a piece of writing or a source is
accurate, fair, and credible?

Join researcher Lisa Gold for a how-to workshop guiding you through the
steps of fact-checking—reading skeptically, asking questions, deciding what
to check, assessing the accuracy of different types of facts, finding and
evaluating sources, working with authors, and making corrections. Lisa will
answer questions and share tips, examples, and resources. This practical,
hands-on workshop will take us beyond the introduction to fact-checking
that Lisa gave at the November 2016 Northwest Editors Guild
meeting [watch it here: https://lisagoldresearch.wordpress.com/2016/11/15/my-talk-about-fact-checking/].

Join us Saturday, November 4, from 10:00 am to 1:00 pm at the Phinney
Neighborhood Center Community Hall (Brick Building, Lower Level) for this workshop. The cost will be $45 for Editors Guild members, $50 for nonmembers.

Register for the workshop online here: https://edsguild.org/dont-assume-anything-practical-fact-checking/

Speaker Bio: Lisa Gold is a freelance researcher, fact-checker, and writer. She has fact-checked many kinds of writing, including magazine articles, reported features, essays, book reviews, historical novels, and nonfiction books. She does creative research for authors of fiction and nonfiction, and she teaches research workshops. You’ll find Lisa online at www.lisagold.com and on Twitter at @bylisagold.

 

 

Let’s talk about search

So my last post, “Yet another study shows that ‘digital natives’ suck at searching,” seems to have struck a nerve– it’s received over 5000 hits (thanks to links from BoingBoing and Fark, as well as Twitter and Facebook), and I’ve been reading  the wide range of comments that have sprung up in various places (including my blog, BoingBoing, and the original article at Inside Higher Ed).

I think what many people (especially students) don’t understand is that search is both a tool and a process, requiring different skills, knowledge, and experience. You can learn just enough to get by or really master it with a little curiosity, persistence, time, and practice. There are many ways to do this, and you don’t need a formal class– you can teach yourself (as I did).  There are lots of online resources to help you, including tutorials and how-to guides on university and library websites and specific search engine help pages. (And don’t forget about librarians, a seriously underused resource.) There are links to some resources in my posts and my blogroll and I’ll add more soon.

I do believe it’s important for students to be taught (and regularly practice)– at school, in libraries, and at home– the essentials of digital and information literacy and critical thinking, starting at a young age  and continuing throughout their education. These are important life skills which are being sadly neglected.

Yes, it’s a truth universally acknowledged that most students are lazy and want to get quick and “good enough” results. But the problem is that they don’t know what they don’t know. (As the ERIAL researchers noted, “students were just as unaware of the extent of their own information illiteracy as everyone else.”) They have no idea that there’s a world of information out there that you can’t find through a Google search. Most of it has never been digitized and probably never will be (for lack of funding and copyright concerns, among other reasons). Some has been digitized but is locked in proprietary databases and the “invisible web.” Most books and articles published in the US after 1922 are still under copyright, so even if they’ve been digitized chances are they aren’t free (unless you borrow them from a library). Even if information has been indexed in Google, you may never find it if you don’t know how to properly search for it.

Google could certainly improve the situation, but it is a company of engineers trying to make search as easy and simple as possible for the vast majority of users, giving them a single “magic box” into which they can type anything and get results, even if they’ve spelled the keywords wrong or don’t really know what they are looking for. Some of the “improvements” they’ve made over time have made it frustrating for advanced users like me, such as ignoring the terms I’ve actually typed and substituting what they assume I’m looking for, or filtering my results based on my past search history. And if you want more advanced search options, Google doesn’t make it easy to find or learn about them, and their help articles often aren’t helpful at all. Search is not just an engineering problem to be solved– it is both an art and a science, and there is no one-size-fits-all solution. But no matter how good or flawed a tool like Google search is, anyone can learn how to use it well and get far better results.

A number of people have asked for some advice and tips on search, so here you go.

General advice:

  • When using any search engine, database, or website with search functions, take a few moments to read the instructions or help pages to figure out how to use the site to its full advantage.
  • Search engines are constantly evolving, so you should periodically review the instructions to see if you need to make changes in the way you search.
  • Every search engine is different, so what works for one won’t necessarily work for all, and each may produce different results using the same search terms.
  • If there is an “advanced search” option, you should always use it, as it will give you much more control over your searching and the results.
  • Refine your search. Experiment with different keywords and combinations of keywords. Look for clues to other possible keywords, such as related terms, alternate names, and subject-specific terminology. If you don’t get the results you are looking for, keep trying different things.
  • Remember to look beyond just the first few search results.

Some Google-specific tips:

  • Google’s search tips and help articles can be hard to find, but they do have useful information. Here are direct links:
  • Use Google’s advanced search function, which allows you to limit your search  in many different ways and combinations (all these words, this exact wording or phrase, one or more of these words, don’t show pages that have any of these unwanted words, language, file type, search within a site or domain, etc.). There is no longer a link to it on the main search page– it’s now hidden behind the gear icon (search settings) in the upper right corner. Here’s the direct link:  http://www.google.com/advanced_search?hl=en
  • If you’d rather use Google’s main search box instead of the advanced search, the help articles I linked to above have command shortcuts you can use, such as placing quotation marks around exact phrases. Note that Google no longer uses all Boolean operators. (You don’t need AND as it is the default in all searches. You can still use OR. Don’t use NOT, instead place a minus sign (-) directly before any words or terms you want to exclude.)
  • Google has many specialized search functions for images, news, blogs, scholarly papers, books, patents, etc.  Look in the upper left-hand corner, click “more,” then click “even more” for a full list. Here’s the direct link to the list:  http://www.google.com/intl/en/about/products/index.html
  • All the words you put in the query will be used and the order you put them in matters.
  • Search is case insensitive, punctuation is usually ignored, and common words (the, a) are usually ignored.
  • Google automatically searches for common variations of a keyword.

A final note: Improving your search skills is important, but it’s even more important that you think critically and evaluate your search results and sources.  (See some of my previous posts for more about this.)

Yet another study shows that “digital natives” suck at searching

I’ve blogged before about studies showing that so-called “digital natives” lack basic information literacy skills and have great difficulty doing academic research and finding and evaluating sources.  (My two posts on Project Information Literacy studies are here and here.)

This Inside Higher Ed article reported today on the results of new studies by the ERIAL (Ethnographic Research in Illinois Academic Libraries) Project. Here’s an excerpt, but you should read the whole thing:

“The majority of students — of all levels — exhibited significant difficulties that ranged across nearly every aspect of the search process,” according to researchers there. They tended to overuse Google and misuse scholarly databases. They preferred simple database searches to other methods of discovery, but generally exhibited “a lack of understanding of search logic” that often foiled their attempts to find good sources….

The most alarming finding in the ERIAL studies was perhaps the most predictable: when it comes to finding and evaluating sources in the Internet age, students are downright lousy….

The prevalence of Google in student research is well-documented, but the Illinois researchers found something they did not expect: students were not very good at using Google. They were basically clueless about the logic underlying how the search engine organizes and displays its results. Consequently, the students did not know how to build a search that would return good sources. (For instance, limiting a search to news articles, or querying specific databases such as Google Book Search or Google Scholar.)

Duke and Asher said they were surprised by “the extent to which students appeared to lack even some of the most basic information literacy skills that we assumed they would have mastered in high school.” Even students who were high achievers in high school suffered from these deficiencies…

In other words: Today’s college students might have grown up with the language of the information age, but they do not necessarily know the grammar.

“I think it really exploded this myth of the ‘digital native,’ ” Asher said. “Just because you’ve grown up searching things in Google doesn’t mean you know how to use Google as a good research tool.”

Even when students turned to more scholarly resources, that did not necessarily solve the problem. Many seemed confused about where in the constellation of library databases they should turn to locate sources for their particular research topic: Half wound up using databases a librarian “would most likely never recommend for their topic.”…

Years of conditioning on Google had not endowed the Illinois Wesleyan students with any searching savvy to speak of, but rather had instilled them with a stunted understanding of how to finely tune a search in order to home in on usable sources, concluded the ERIAL researchers.

Regardless of the advanced-search capabilities of the database they were querying, “Students generally treated all search boxes as the equivalent of a Google search box, and searched ‘Google-style,’ using the ‘any word anywhere’ keyword as a default,” they wrote. Out of the 30 students Duke and Asher observed doing research, 27 failed to narrow their search criteria at all when doing so would have turned up more helpful returns.

Unsurprisingly, students using this method got either too many search results or too few. Frequently, students would be so discouraged they would change their research topic to something more amenable to a simple search….

Duke and Asher noted: “Students showed an almost complete lack of interest in seeking assistance from librarians during the search process.” Of all the students they observed — many of whom struggled to find good sources, to the point of despair — not one asked a librarian for help.

In a separate study of students…, other ERIAL researchers deduced several possible reasons for this. The most basic was that students were just as unaware of the extent of their own information illiteracy as everyone else.

How are students supposed to acquire these important digital and information literacy skills if they aren’t being taught in schools, many parents and teachers lack these skills themselves, and the librarians who have the skills are ignored or fired as libraries close in record numbers?

Productive frustration, lost libraries, and required reading for aspiring writers

As I read  Ben Greenman’s “Lives” essay in today’s New York Times Magazine, this caught my eye:

By supplying answers to questions with such ruthless efficiency, the Internet cuts off the supply of an even more valuable commodity: productive frustration. Education, at least as I remember it, isn’t only, or even primarily, about creating children who are proficient with information. It’s about filling them with questions that ripen, via deferral, into genuine interests.

I like the term “productive frustration” because it is an apt description of part of the research process. Finding quick answers to simple questions is easy, but it shuts down the process. When you wrestle with more complex questions, try to figure out what you need to know and the different ways you might look for it, or can’t find what you’re searching for, it’s an opportunity to think more creatively, explore new paths, feed your curiosity, and make unexpected discoveries. As the Gaga librarians sang, “when it comes to search if it’s not tough it isn’t fun…”

Here are links to some other excellent pieces I read this week:

Craig Fehran’s “Lost Libraries: The strange afterlife of authors’ book collections” describes what happened to the personal libraries of David Foster Wallace, David Markson, John Updike, Mark Twain, and other authors after their deaths:

Most people might imagine that authors’ libraries matter–that scholars and readers should care what books authors read, what they thought about them, what they scribbled in the margins. But far more libraries get dispersed than saved. In fact, David Markson can now take his place in a long and distinguished line of writers whose personal libraries were quickly, casually broken down….

The issues at stake when libraries vanish are bigger than any one author and his books. An author’s library offers unique access to a mind at work, and their treatment provides a look at what exactly the literary world decides to value in an author’s life….

Charles Stross’ post “Why Wikipedia is the writer’s friend” explains why fiction writers should do at least enough research to avoid errors and prevent the collapse of suspension of disbelief. Though I certainly don’t recommend using Wikipedia as your source for research and fact-checking, I strongly agree with his larger point: 

Always do, at a minimum, a brief fact-check on everything you write in a work of fiction with a non-fantastical/far future setting. Otherwise you will be sorry….

Fiction relies upon the reader’s willing suspension of disbelief — if you’re immersed in a novel, it helps not to be jerked up short every ten seconds by the realization that the setup is nonsensical….

John Scalzi’s “Writing: Find the Time or Don’t.” This post should be required reading for all aspiring writers:

So: Do you want to write or don’t you? If your answer is “yes, but,” then here’s a small editing tip: what you’re doing is using six letters and two words to say “no.” And that’s fine. Just don’t kid yourself as to what “yes, but” means.

If your answer is “yes,” then the question is simply when and how you find the time to do it. If you spend your free time after work watching TV, turn off the TV and write. If you prefer to spend time with your family when you get home, write a bit after the kids are in bed and before you turn in yourself. If your work makes you too tired to think straight when you get home, wake up early and write a little in the morning before you head off….

And if you can’t manage that, then what you’re saying is that you were lying when you said your answer is “yes.” Because if you really wanted to write, you would find a way to make the time, and you would find a way to actually write….

But if you want to be a writer, than be a writer, for god’s sake…. Find the time or make the time. Sit down, shut up and put your words together. Work at it and keep working at it. And if you need inspiration, think of yourself on your deathbed saying “well, at least I watched a lot of TV.” If saying such a thing as your life ebbs away fills you with existential horror, well, then. I think you know what to do.

Intute–one of the best link collections for research–has lost its funding

I learned today (via Resource Shelf) that Intute, one of my favorite research websites because of its great annotated reference link collection, will lose its funding as of August 2010. From the announcement on Intute’s blog:

We regret to inform our users and contributors that JISC has announced that its funding for Intute will be cut with effect from August 2010. It is JISC policy that, wherever possible, services move from being fully funded by JISC to being sustainable by other means. Unfortunately in the current economic climate no realistic alternative funding model for Intute as it currently stands has been identified.

Despite this JISC has acknowledged the pioneering work of Intute, its value to the community, and the insights it has given into the use of the Internet in education.

Our current service level will be maintained until 1 August 2010. After this date, Intute will still be available but with minimal maintenance….

Intute was created by a group of UK universities as a free online service to help find the best web resources for education and research. The site contains an excellent (and very large) annotated collection of links to web resources that you can search or browse by subject. (Subject specialists select and evaluate the sources and write detailed descriptions.) Intute also provides Internet training with free resources and online tutorials to help develop Internet research skills.

Intute is funded by JISC, the Joint Information Systems Committee, which, according to its website, “inspires UK colleges and universities in the innovative use of digital technologies, helping to maintain the UK’s position as a global leader in education.” Here’s the link to JISC’s statement about cutting off funding for Intute. It sounds like they may keep the site up, but if the database of web sources is not regularly updated and expanded, over time it will become outdated and far less useful.

Sites like Intute are crucially important, as their experts curate the web by finding, evaluating, and highlighting the most useful and credible sources, more of which come online every day. I wonder if there’s any chance of some institution stepping up to provide funding or take over the project.

Update, January 2015: Intute has disappeared from the web.

More from Project Information Literacy: “the librarian approach is based on thoroughness, while the student approach is based on efficiency”

Back in April, I wrote a long post titled “I’m shocked to discover there’s gambling in this casino…” about Project Information Literacy’s February 2009 report, “Finding Context: What Today’s College Students Say About Conducting Research in the Digital Age.” In my post I was somewhat snarky about the report, as you can see from this excerpt:

Surely it isn’t a surprise that:

* Students always procrastinate and are looking for quick and easy answers.

* Students spend an unlimited amount of time and effort on things that interest them, but do the minimum necessary on academic assignments.

* Students universally use the two tools that are the most convenient, familiar, and useful to them– Wikipedia and Google– and they will continue to do so regardless of what their professors say.

* Students don’t know or haven’t been taught how to do research, think critically, find and evaluate sources (online and in the library), and efficiently sift through the overwhelming amount of available information to find what they need.

This month Project Information Literacy released a new and more in-depth report, “Lessons Learned: How College Students Seek Information in the Digital Age”, and it offers some valuable insights and recommendations. Among the report’s conclusions:

When it came to everyday life research, nearly all of the respondents used Google, Wikipedia, and friends for finding context. Almost all of the students used course readings, [online] library resources, and public Internet sites such as Google and Wikipedia, when conducting course-related research—no matter… what resources they had at their disposal.

The relatively consistent pattern of information usage suggests that most students in our study favored a risk-averse and predictable information-seeking strategy. The student approach appears to be learned by rote and reliant on using a small set of resources nearly each and every time.

At the same time, the student approach may sometimes backfire. Using public sites on the Internet, such as Google search, early on, may be one reason why students reportedly find research frustrating in the digital age.

We have found studentsʼ frustrations and challenges involve narrowing down topics, finding relevant resources, sorting through too many results from online searches, and evaluating the credibility of what students choose to use. Still, almost all students used public Internet sites early on, despite their known limitations….

A significant majority of students in our sample–8 in 10–did not ever consult librarians for course-related research assignments. Instead, instructors played an important role in coaching students through the research process…

When it comes to finding information and conducting research, today’s students clearly favor brevity, consensus, and currency in the information sources they seek… [They] have defined their preferences for information sources in a world where credibility, veracity, and intellectual authority are less of a given–or even an expectation from students–with each passing day.

All in all, we are reminded of a comment from one student… about using books from the campus library: “Books, do I use them? Not really, they are antiquated interfaces. You have to look in an index, way in the back, and it’s not even hypertexted linked.”

Today’s students are not lazy or unthinking. This student, representing many, looks at information sources, systems, and services as to how well they meet his or her needs in terms of content, accessibility, and usefulness….

So students prefer to use web sources like Google and Wikipedia because they are fast, convenient, familiar, and produce results, meeting their needs for “content, accessibility, and usefulness.” Of all of the library resources provided to students, online scholarly research databases are used the most, as not only do instructors require their use to find credible content, but they are easy to search. Students aren’t using resources like books, even when they are better and more authoritative for academic research, because they take more time, thought, and effort to find and use, and they can’t be quickly and easily searched. This makes sense– today’s college students are digital natives.

I didn’t understand why so few students use librarians as a resource until I read about the “critical difference between the students’ approach and the librarians’ approach” to research:

“The library guide recommends beginning course-related research by using library resources to identify and narrow down a topic. These resources, the library catalog and periodical indices, are all vetted, credible, and authoritative. Only much later in the research process, and only after a topic has been safely nailed down, does the guide recommends turning to Internet resources, such as Google… The student approach is different… [They] reported using public Internet sources (i.e. Google and Wikipedia) in their initial stages of research for a variety of reasons, which included a belief that the Internet is an all-inclusive information resource… All in all, the librarian approach is based on thoroughness, while the student approach is based on efficiency. To that end, librarians suggest using scholarly resources, while many students in our study used a wide range of resources that deliver an abundance of results early on, whether they are scholarly or not. As a whole, the findings suggest that students in our sample favored sources for their brevity, consensus, and currency over other qualities and less so, for their scholarly authority.

At the end of the report, the authors make a series of recommendations, of which I thought these were particularly important:

Course-related research assignments should not indirectly encourage students to half-heartedly engage in a narrow exploration of the digital landscape (e.g., assignments that state requirements such as, “must use five sources cited in your paper”). Administrators, faculty, and librarians should examine whether research-based assignments result in opening studentsʼ minds to expand their information-gathering competencies. Instead, we recommend that students be given course-related research assignments that encourage the collection, analysis, and synthesis of multiple viewpoints from a variety of sources, so the transfer of information literacy and critical thinking competencies may be more actively called up, practiced, and learned by students…

Our work leads us to draw an important distinction between library services and library resources… For the most part, in our study, librarians were left out of the student research workflow, despite librariansʼ vast training and expertise in finding information. Librarians should systematically (not just anecdotally) examine the services they provide to students… Questions should be addressed about how and why services and resources are used—not only how often (e.g., circulation or reference desk statistics). Librarians may want to initiate their analysis by asking what percentage of their campus are using the library, for what particular resources or services, and why or why not?

So what do you think? How can we expand the minds and research methods of digital natives? We can’t convert all information to digital form, so are there ways to pry them away from their computers and into the stacks? Should we even try? Rather than trying to change the ways they do research, should we instead focus on teaching them to improve their web search skills and find and evaluate digital sources? Can we provide better or more authoritative alternatives to Wikipedia and Google, or make it easier to find academic sources with one search? How can we make academic research more interesting and creative for students?

I welcome your comments and ideas.

Time-traveling through the English language with the Historical Thesaurus of the Oxford English Dictionary

Matt and I recently had the opportunity to spend some time browsing through the Historical Thesaurus of the Oxford English Dictionary, which has just been published by Oxford University Press.

I began by reading the introduction. He began by looking up curse words. Once he had satisfied his curiosity about when certain very popular profanities first entered the English language, he turned his attention to the more unusual words within the inferior persons, as abused subcategory that have fallen out of use, such as windfucker (1602 to 1616), hog-rubber (1614 to 1621), chuff-cat (1653), shit-sack (1769 to 1785), and son of a sea-cook (1806 to 1977). This led to an animated discussion of the common themes that connected many of the words—comparisons to animals, sex with relatives or objects, and the inability to control one’s bowels.

That’s what happens when you put the Historical Thesaurus of the Oxford English Dictionary within reach of a writer.

Browsing this work feels strangely like time-travel. All the words from Old English to 2003—obsolete and current, including slang and dialect—have been extracted from the Oxford English Dictionary and organized by their meanings and dates of use. This places each word within its historical context, revealing how ideas and meanings emerged and the different ways they’ve been expressed through time.

It took forty-four years to bring the HTOED to publication, overcoming what the editors politely describe as “a series of intellectual, financial, and domestic challenges.” About 800,000 meanings from the OED were transcribed onto slips of paper and organized into a unique classification system with over 236,000 categories and subcategories. A fire in 1978 would have destroyed a decade of work but for the fact that the paper slips were stored in a metal filing cabinet. They could have finished making slips by 1980, but the decision was made to add new material from the second edition of the OED and the supplements. Computers were eventually used to enter, store, and retrieve data, but much of the work continued to be done by hand.

The result is the world’s largest thesaurus, nearly 4000 pages of small type in two big volumes weighing fifteen pounds, with a slipcase and folding chart of the top levels of the classification system. I like print references because browsing can lead to serendipitous discoveries, but these books can be awkward to use. It’s especially frustrating when looking up a word with multiple meanings, as the index may list dozens of identification numbers, which means lots of page flipping. No, it’s not available online or on CD, though that may eventually change. I’d like to see the powers-that-be at Oxford University Press quickly add the HTOED to the online OED so both works can be used together and fully cross-referenced and searched.

The classification system of the HTOED is mind-bogglingly complex, forming a hierarchy of meaning from the general to the specific. At the highest level are the three main sections—the external world, the mental world, and the social world—which divide into 26 major categories, such as the earth, life, emotion, society, morality, faith, armed hostility, and communication. These branch into more detailed categories like food, clothing, people, animals, transport, love, moral evil, and sexual relations. More specific categories and subcategories lead to the synonyms and related words, which are organized by part of speech and listed chronologically with the date of the first recorded use in English and, for obsolete words, the last recorded use. (I recommend reading the “guide to the use of the thesaurus” to get your bearings.)

Each level in this hierarchy of meaning is assigned a two-digit number, which when combined creates identification numbers for every word in the thesaurus. Some words have many identification numbers because they have numerous meanings or have changed their meanings over time and thus appear in different locations within the thesaurus.

For example, in the alphabetical index, the first identification number for the noun serendipity, one of my favorite words, is 01.05.05.10.02.01|10.01, locating it in the thesaurus within these nested categories and subcategories:

01                                                         the external world
01.05                                                  existence in time and space
01.05.05                                           action/operation
01.05.05.10                                    endeavour
01.05.05.10.02                             searching/seeking
01.05.05.10.02.01 (n.)              finding/discovery (noun)
01.05.05.10.02.01|10               accidentally (subcategory)
01.05.05.10.02.01|10.01        faculty of making happy discoveries by chance

Here you’ll find that the noun serendipity was first cited in 1754. After the finding/discovery (noun) category is the finding/discovery (adjective) category, in which serendipitous (01.05.05.10.02.01|03) dates from 1958.

The HTOED will clearly be important to the study of the English language, but it also could contribute to other subjects, especially history, literature, and culture. The descriptions of life and the earth over centuries are like crash courses in the history of science and medicine. Cultural historians will look for clues in our language to our attitudes about gender, race, and class, as with the words used to describe women based on animals (mare, hen, cow, heifer, bird) or clothing (skirt, smock, petticoat). Advancements in technology are reflected in subjects like travel, tools, telecommunications, and computing. Shakespeare scholars will be able to compare the words in use during his lifetime and argue about the reasons for his word choices. Even a category like clothing can reveal shifts in morality, as when underwear became unmentionables in 1823.

I believe the HTOED could be a rich source of inspiration and world-building for writers. Historical novelists could gain insight into the past and how people lived, what they knew and believed, and how they described their own world. And they’ll know whether the words their characters are speaking were actually in use at the time. (Elizabethans would not have called a packed meal a picnic, as it was first cited in 1748.)  Fantasy writers may unearth ideas in forgotten names or descriptions of supernatural beings and mythical creatures. Poets can reintroduce lyrical and imaginative words that have fallen out of use, such as candel (Old English to 1634), luminair (1456 to 1560), or streamer (1513 to 1647), all of which once described heavenly bodies. Eclectic writers like my husband who have a strong love of word-play and enjoy collecting unusual bits of knowledge will find it addictive.

Let’s say you’d like to take advantage of the current craze for vampires or literary monster mash-ups like Pride and Prejudice and Zombies. The HTOED can tell you when different monsters first entered our nightmares and what we called them at distinct points in time. Follow the hierarchy of categories from the external world to the supernatural to supernatural being/spirit to malignant monster (noun). Here you’ll find that the word vampyre was first cited in 1734, followed by vampire in 1796. Though vampire is still in use today, the last recorded OED citation for vampyre was in 1847. Referring to vampires as undead didn’t begin until 1897. Werewolves trace all the way back to the Old English werewulf, lycanthrope was first cited in 1813 and is still in use, but the more poetic turnskin entered the language in 1831 and exited forty years later. Oh, and zombie was first cited in 1819, two years after the death of Jane Austen.

The editors have included all those words that have been too controversial for some other dictionaries and thesauruses. Curse words, sexual slang, and offensive slurs for racial and sexual minorities appear dispassionately in their chronological place among their less inflammatory cousins. Reading certain entries may cause shock, disgust, or pain, but there is value in putting these powerful words in their historical context. If you are easily offended or prefer your works expurgated, consider yourself warned.

Priced at $395 (on sale at Amazon for $316), the HTOED will unfortunately be out of reach for many of the writers and word lovers who might appreciate it, so keep it mind if you are looking for a fabulous gift for your favorite logophile.

For more information, check out this OUP website for the HTOED and this OUP blog post with “fun facts and figures” about the work. Here’s the link to a sample page from the work at the OUP website.

UPDATE, 10/28/09: I received an email from Christian Kay, editor of the HTOED. There are indeed plans to eventually link the HTOED to the OED online and make it available to subscribers, but that could be a couple of years away. There are no plans for a CD version. So it looks like the books will be the only option for quite some time.

NEW UPDATE, 6/18/10: The Oxford English Dictionary Online will be relaunched in December 2010 and will include an integrated online edition of the Historical Thesaurus. See my blog post “Word lovers rejoice” for more information.

NEW UPDATE, 11/30/10: The new OED website has launched, fully integrating the online Oxford English Dictionary with the Historical Thesaurus. See my new blog post for more information.

In praise of browsing

For me, an important element of creative research is serendipity, which the OED defines as “making happy and unexpected discoveries by accident.” Browsing is a great way to open yourself to serendipity, but it is unfortunately becoming a lost art in this digital age.

Browsing and searching are different– browsing is about the journey, searching is about the destination. Searching is focused on finding specific information quickly and often leads to tunnel-vision, which can prevent you from recognizing useful sources that don’t match your preconceived ideas and assumptions. Browsing is about slowing down, opening your eyes, feeding your curiosity, and allowing yourself the opportunity to make discoveries.

I believe it’s important to set aside time to browse on a regular basis– not just on the web, but in the physical world as well. Spend time exploring different bookstores (both new and used), visit libraries and museums, and search out unusual places you’ve never visited. Take a different route, walk around neighborhoods you don’t live in, look for hidden treasures.

Whenever you are looking for something in a bookstore or library, always browse the surrounding books and nearby shelves. I can’t tell you the number of times that I found books much better than the one I was looking for by doing this. Sometimes you don’t really know what you need until you find it.

Remind yourself to occasionally browse unfamiliar sections or subjects in bookstores and libraries, rather than only the ones you think will be of interest to you. Bookstores (especially those selling used books) each have their own idiosyncratic system of categorizing, so what you want may be in a section you never visit, and if you change your routine you might stumble across amazing things you aren’t looking for.

Use sources to lead you to other sources. Whenever you are looking at a book or article, browse the bibliographies or lists of references cited, as this will often reveal useful sources you might not have found on your own. When you discover an interesting blog or website, check out the list of links and bookmark those that may be useful to you. Talk to people and ask them for recommendations.

If something arouses your curiosity or inspires you, embrace the creative impulse and and see where it leads. Write stuff down. Let your mind wander.

I hope you’ll all spend some time browsing this holiday weekend. You never know what you might discover….

I’m shocked to discover there’s gambling in this casino…

I recently read the February 2009 Project Information Literacy Progress Report from the Information School at the University of Washington, titled “Finding Context: What Today’s College Students Say about Conducting Research in the Digital Age.”

The goal of the project is to “understand how early adults conceptualize and operationalize research activities for course work and ‘everyday use’ and especially how they resolve issues of credibility, authority, relevance, and currency in the digital age.”

While I admire these goals and am obviously interested in the subject, I wasn’t surprised by the findings, and I found some of the quotes from students unintentionally funny.

The report begins by asking:

What is it like to be a college student in the digital age? In a world teeming with information technology and overflowing with access to data, how do students find the information they need? How do students conduct research for course-related assignments? How do they conduct research for use in their everyday lives? What frustrations and obstacles do they encounter? What strategies have students developed to meet their information needs?

Their conclusion?

So far, we have found that no matter where students are enrolled, no matter what information resources they may have at their disposal, and no matter how much time they have, the abundance of information technology and the proliferation of digital information resources make conducting research uniquely paradoxical: Research seems to be far more difficult to conduct in the digital age than it did in previous times… In general, students reported being challenged, confused, and frustrated by the research process, despite the convenience, relative ease, or ubiquity of the Internet. In our sessions, frustrations included the effects of information overload and being inundated with resources, but more. Participants also reported having particular difficulty traversing a vast and every-changing information landscape. Specifically, participants greatest challenges were related to finding the materials they desired, knew existed, and needed on a “just in time” basis.

Some specific findings:

  • Students treat everyday life research and course-related research differently, and they put a lot more time and effort into the former. “Students reported that searches for everyday life information could last for days, and were driven by curiosity, as students clicked on Google results or Wikipedia citations and unfolded layers of information.”
  • Students “reported almost twice as many frustrations, overall, with conducting course-related research than with everday life research, though the nature or type of participants’ frustrations had underlying similarities.” Everyday life frustrations included: too many results from a Google search and the need to sort through them; knowing the “answer” is online but not being able to find it; figuring out whether a source is credible; knowing that everything is not online; and never can find enough information on the obscure topic being searched. Course-related frustrations included: information overload; too much irrelevant information, can’t locate what is needed from online results; trying to find the “perfect” source; not knowing what to look for, yet still sifting through articles that might fit; trouble finding books needed on library shelves; and conducting research to meet another’s expectations.
  • Students begin their course-related research with Wikipedia. They begin their everyday life research with Google, followed by blogs and Wikipedia.
  • “The majority of the students we interviewed did not start on an assignment–thinking about it, researching, or writing–until two or three days before it was due.”

The study authors note that for students, conducting research “may feel a lot like being an inexperienced sailor heading directly into an oncoming wind, sails wildly flapping, and not being able to maneuver and get to a desired destination.” They believe this is because students are struggling to “find context,” and they go on to describe four types of context students seek: big picture; language, situational, information gathering.

But what do students do to find context? They use Wikipedia, despite its flaws and warnings from their professors. “We found the majority of students ignored the negatives and went to the site anyway. Most students depended on and used Wikipedia for information cited in papers, but just never included Wikipedia entries on their Works Cited page. In our sessions, students also discussed concerns over Wikipedia and accuracy. However, most participants believed that they, themselves, had the ability to discern the credibility of a Wikipedia source, based on their ‘gut level’ interpretation of Wikipedia’s rating system.”

Surely it isn’t a surprise that:

  • Students always procrastinate and are looking for quick and easy answers.
  • Students spend an unlimited amount of time and effort on things that interest them, but do the minimum necessary on academic assignments.
  • Students universally use the two tools that are the most convenient, familiar, and useful to them– Wikipedia and Google– and they will continue to do so regardless of what their professors say.
  • Students don’t know or haven’t been taught how to do research, think critically, find and evaluate sources (online and in the library), and efficiently sift through the overwhelming amount of available information to find what they need.

I wrote in my very first blog post, “Research is like treasure hunting, and to do it well you must be skeptical, curious, discriminating, persistent, and willing to look beneath the surface.” Research also requires time and patience, but it is a skill that can be learned with a little knowledge and practice. However, research is much easier to do when motivated by a love of learning, intellectual curiosity, and passion for your subject, rather than obligation.  Even I don’t like researching subjects I find boring. So while we clearly need to teach students how to do research, we should also be giving them assignments that not only educate them, but also interest and inspire them.

A fascinating post last month in the blog In the Library with the Lead Pipe gave a librarian’s perspective on this, discussing some of the “questionable assignments” given to students by well-meaning faculty members:

Every semester there is at least one assignment that comes across my reference desk that makes me throw my hands up in exasperation (such as: a scavenger hunt that was written before we moved much of our content online or the requirement that the student must have at least one print source, library databases and ebooks do not count)…

We kept getting students who had the same (admirable) weekly assignment: find and read a newspaper article covering the event they were studying that week. The article (or possibly other primary source document) had to have been written during the time of the event and from the perspective of the people involved. We had been doing fine helping them find historical and foreign papers as needed, until they came to the Ottoman Empire. And it didn’t stop there. The class was a survey of world history. They continued to have topics that simply might not have ever been documented by the people involved, unlikely in newspaper article form, certainly not in English, and may not have ever been translated into English if it did manage to get written down and preserved. African events were also particularly difficult…

Scavenger hunt assignments are frustrating for everyone. Looking up trivia is not the same as conducting research and without a meaningful application of the process of using the library anything they learn through the scavenger hunt is less likely to stick…

Often the student, the faculty, or both don’t differentiate between the free web and resources that the library has purchased, but are available electronically…

Librarians are on the front lines of this battle, and both students and faculty would benefit from using them more as a resource.

It looks like I now have lots of subjects for future blog posts…

Great news about Google Book Search

Like everyone else, I use Google dozens of times a day. Their web search engine is still the best I’ve found, but Google also has a number of more specialized search functions that I like and use regularly. By far my favorite is Google Book Search, which is not one of the main functions listed on Google’s home page, but it will appear if you click the “more” link.

Google Book Search enables you to search the full texts of all of the books stored in its database, which is made up of books scanned from the collections of cooperating libraries (including Harvard University, Oxford University, and the New York Public Library, to name a few) and digital book files submitted by publishers. For books in the public domain (published in the U.S. before 1923), you can read as much of the text as you like online, download a PDF file of the entire book, and print as many pages as you want. For books still under copyright, if the author or publisher has given permission, you may be able to view a limited number of pages (though you cannot print or copy any of the text), but most books are restricted to only a few lines or no preview at all. Google displays detailed information about each book, and you can see a list of libraries that have it or buy a copy from an online bookstore.

I love Google Book Search because it helps me discover extremely useful and interesting books of all kinds– old and new, in-print and out-of-print, primary and secondary sources, and valuable sources of information long neglected or forgotten. I love having instant access to complete works in the public domain that are out of print and not available through my local libraries. I love being able to search the texts of millions of books simultaneously for words or phrases– names, places, dates, subjects, titles, historical events, etc.

Google Book Search has always been controversial because it scans books still under copyright without obtaining permission from the authors and publishers, and Google has been sued by the Authors Guild and the Association of American Publishers over it. The great news is, according to Publishers Weekly, the lawsuits have been settled, and everyone wins– millions of books under copyright will be searchable online, there will be a way to purchase full online access to many copyrighted works, the full texts of out-of-print books will be viewable for free on library computer terminals, authors and publishers will control whether or not their works are included and share in the revenue generated through online access to their works, a nonprofit Book Rights Registry will be set up, etc. As both a researcher and the wife of an author, I am thrilled.

Here are excerpts from the AAP statement detailing the settlement:

The Authors Guild, the Association of American Publishers, and Google today announced a groundbreaking settlement agreement on behalf of a broad class of authors and publishers worldwide that would expand online access to millions of in-copyright books and other written materials in the U.S. from the collections of a number of major U.S. libraries participating in Google Book Search….

The agreement promises to benefit readers and researchers, and enhance the ability of authors and publishers to distribute their content in digital form, by significantly expanding online access to works through Google Book Search, an ambitious effort to make millions of books searchable via the Web. The agreement acknowledges the rights and interests of copyright owners, provides an efficient means for them to control how their intellectual property is accessed online and enables them to receive compensation for online access to their works.

If approved by the court, the agreement would provide:

  • More Access to Out-of-Print Books — Generating greater exposure for millions of in-copyright works, including hard-to-find out-of-print books, by enabling readers in the U.S. to search these works and preview them online;
  • Additional Ways to Purchase Copyrighted Books — Building off publishers’ and authors’ current efforts and further expanding the electronic market for copyrighted books in the U.S., by offering users the ability to purchase online access to many in-copyright books;
  • Institutional Subscriptions to Millions of Books Online — Offering a means for U.S. colleges, universities and other organizations to obtain subscriptions for online access to collections from some of the world’s most renowned libraries;
  • Free Access From U.S. Libraries — Providing free, full-text, online viewing of millions of out-of-print books at designated computers in U.S. public and university libraries; and
  • Compensation to Authors and Publishers and Control Over Access to Their Works — Distributing payments earned from online access provided by Google and, prospectively, from similar programs that may be established by other providers, through a newly created independent, not-for-profit Book Rights Registry that will also locate rightsholders, collect and maintain accurate rightsholder information, and provide a way for rightsholders to request inclusion in or exclusion from the project.

Under the agreement, Google will make payments totaling $125 million. The money will be used to establish the Book Rights Registry, to resolve existing claims by authors and publishers and to cover legal fees. The settlement agreement resolves… lawsuits [that] challenged Google’s plan to digitize, search and show snippets of in-copyright books and to share digital copies with libraries without the explicit permission of the copyright owner.

Holders worldwide of U.S. copyrights can register their works with the Book Rights Registry and receive compensation from institutional subscriptions, book sales, ad revenues and other possible revenue models, as well as a cash payment if their works have already been digitized.

Libraries at the Universities of California, Michigan, Wisconsin, and Stanford have provided input into the settlement and expect to participate in the project, including by making their collections available. Along with a number of other U.S. libraries that currently work with Google, their significant efforts to preserve, maintain and provide access to books have played a critical role in achieving this agreement and, through their anticipated participation, they are furthering such efforts while making books even more accessible to students, researchers and readers in the U.S. It is expected that additional libraries in the U.S. will participate in this project in the future….

“It’s hard work writing a book, and even harder work getting paid for it,” said Roy Blount Jr., President of the Authors Guild. “As a reader and researcher, I’ll be delighted to stop by my local library to browse the stacks of some of the world’s great libraries. As an author, well, we appreciate payment when people use our work. This deal makes good sense.”

“This historic settlement is a win for everyone,” said Richard Sarnoff, Chairman of the Association of American Publishers. “From our perspective, the agreement creates an innovative framework for the use of copyrighted material in a rapidly digitizing world, serves readers by enabling broader access to a huge trove of hard-to-find books, and benefits the publishing community by establishing an attractive commercial model that offers both control and choice to the rightsholder.”

“Google’s mission is to organize the world’s information and make it universally accessible and useful. Today, together with the authors, publishers, and libraries, we have been able to make a great leap in this endeavor,” said Sergey Brin, co-founder & president of technology at Google. “While this agreement is a real win-win for all of us, the real victors are all the readers. The tremendous wealth of knowledge that lies within the books of the world will now be at their fingertips.”

Updates:

Here is Google’s blog post about the settlement.

Here the future changes to Google Book Search resulting from the agreement are explained.