TheDailyJournal

 

||| TECH. Thanks to the automated tagging

 

Web links can surprise

 

Samantha Gross | AP Writer

NEW YORK – It wasn't what anyone expected to see while perusing a news article.
But there, in the final paragraph of an online story about the call girl involved in the Eliot Spitzer scandal, Yahoo's automated system was inviting readers to browse through photos of underage girls.
Yahoo Shortcuts, which more frequently offers to help readers search for news and Web sites on topics like "California" or "Presi-dent Bush," had in this case highlighted the words "underage girls."
Readers who passed their mouse over the phrase in The Associated Press story were shown a pop-up window with an image from Flickr, Yahoo's photo-sharing Web site.
Clicking on the pop-up window produced several pictures with nothing untoward, although some of the captions claimed that attached photos showed underage drinking.
But it also yielded more-disturbing results: hundreds of images, including some of a girl or woman in pigtails, knee socks and lingerie. One photo showed a faceless female body, naked.
The misstep, which happened in early July, was noted on a technology blog.
Editors at AP contacted Yahoo Inc., where a spokeswoman said the company quickly removed the link. Several of the more provocative photos were apparently taken off of Flickr.
The phrase "underage girls," now added to a list of thousands of previously blocked terms, will never again generate a Yahoo Shortcut, the company said.
But the incident highlights how difficult it can be for publishers to keep a tight rein on their sites in this age of user-generated content.
Internet publishers are increasingly relying on automated systems to tag phrases of interest and, in some cases, to provide links to other sites. With legions of YouTube users, Flickr photographers and anonymous bloggers posting floods of their own, largely unsupervised material, it's impossible for publishers using automation to exercise total control.
"No matter how sophisticated you make these automated systems, you're not going to make them perfect, and all you can really strive for is to tune them as you go along," said Lauren Weinstein, co-founder of People for Internet Responsibility.
Still, he said, in this case "it's pretty clear there was a lapse in terms of the quality control of Ya-hoo's keyword list."
It's unclear how, exactly, "underage girls" was selected as a useful link. Yahoo Shortcuts "leverages a combination of algorithmic and editorial processes to identify current, relevant and popular terms," said spokeswoman Meagan Busath.
Among the factors the system considers: terms entered into Yahoo's search engine.
That raises the unsavory prospect that "underage girls" could be among the most popular searches on Yahoo, said Chris Sherman, executive editor of the industry Web site Search Engine Land.
But he said it's more likely a combination of factors was at play. Perhaps a similar phrase is a popular search term, or perhaps the exploitation of young women has become a hot news topic.
The selection of the phrase could also have been driven by its relevance to the story at hand – after all, the AP article was about how Ashley Dupre had dropped a lawsuit that claimed she was underage when she appeared in a "Girls Gone Wild" video.
If the system was merely checking whether Flickr had a sufficient number of relevant results, the answer apparently would have been yes.
Although Busath notes that Flickr users and employees monitor the site's content and report problematic images, a search of the site for the words "underage girls" turned up 428 photos.
Any technology has its hiccups as engineers refine it, and over the years automated content has occasionally offered offense.
In one recent flub, a Yahoo photo collection about Osama bin Laden began with a picture of Sen. Barack Obama.
There was nothing wrong with the programming (the senator had been at a hearing about the al-Qaida leader, and his photo was the most recent in the collection), but Yahoo rewrote its programming code to block the same thing from happening again, a spokesman said. |||
For those sites considering expanding their use of the technology, the biggest question may be whether Web users will embrace the presence of more links or whether they'll become fed up with something that could become as annoying - and as irrelevant to them - as pop-up ads. |||