Wikia search engine gains a Web crawler

Jimmy Wales announced a significant component for the Search Wikia open-source search engine: the Grub Web crawler

Wikia's project to develop an open-source search engine got another boost with the acquisition of the Grub distributed Web crawler, the company announced Friday.

Wikia acquired Grub from LookSmart and released it under an open-source license, adding a significant component to Search Wikia, scheduled to debut in this year's fourth quarter.

The Search Wikia project seeks to create a search engine based on open-source search protocols and human collaboration, drawing from the concept of the Wikipedia online encyclopedia, which is written and edited by a community of volunteer collaborators.

As such, it will provide a better search experience than the ones offered by commercial, proprietary search engines like those from Google Inc., Yahoo and Microsoft, said Jimmy Wales, Wikia's co-founder and chairman, and Wikipedia founder.

However, people shouldn't expect Search Wikia's first version to deliver on this lofty goal. "When I opened Wikipedia with only three articles in it, it was a pretty bad encyclopedia," Wales said. "That's where we're going to be in December with the search engine: We'll tell people upfront: 'It's not very good yet.' But we'll open it up to get feedback and community involvement to help us make it better."

Wikia, which has about 35 full-time employees, is developing its own relevancy technology to rank search results, he said. At this point, it's not clear whether at launch Search Wikia will offer only general Web searching or if it will have specialty engines for things like images and news.

Visually, Search Wikia will have the standard search-engine interface. "The differences will be that at various points all along that search process, there'll be opportunities for people to engage with the community and join and participate in the construction of the search results," Wales said.

Since Search Wikia's announcement in December, an increasing number of organizations, such as search and online advertising player LookSmart, have expressed interest in the project, Wales said. In the coming months, Wikia will provide more details about third-party partnerships and support.

"A lot of second-tier [search] players understand that competing with Google directly as independent proprietary projects they'll never catch up because they don't have enough individual resources. But by banding together using open-source software, they can effectively compete with Google and improve their services that way," Wales said.

In fact, Wales believes that supporting Search Wikia would be in the best interest not only of second-tier search engines but also of search leaders, including Google. Search isn't a "defensible business" because it's very easy for people to switch among providers, so just like Google unseated AltaVista, another large competitor could emerge, he said. Thus, democratizing search in the way Search Wikia will attempt to do is a good thing for Google.

"Google will be much better off if search becomes ubiquitous and there are lots of players because that doesn't threaten their dominance in the advertising market," Wales said. "Google is much better off with lots of players and being the ad broker for everyone, because ad brokerage is a defensible business."

Allowing users to participate in the design and index of a search engine isn't new. For example, Yahoo's popular Answers search engine lets users pose and answer questions, while Inc. recently launched a search engine whose index is handcrafted by people, not automated Web crawlers. Social bookmarking sites like Yahoo's Delicious compile users' favorites Web site links and makes the links collection searchable by other users. There is even an open-source search engine project called Nutch.

"There's definitely a place for social search and human editorial input into search results," said industry analyst Greg Sterling from Sterling Market Intelligence.

However, so far the efforts haven't had a significant impact on the overall search market, he said. "If you can do something to really improve the quality of the search results, or if you present results in a radically new interface, or do a mix of those two, you might have shot of getting some attention," Sterling said.

If anyone can pull it off, it's Wales, who proved against all odds that an open, free, volunteer-backed encyclopedia could achieve the level of success Wikipedia has, Sterling said. "Wikipedia does suggest there is a probability of success in this," Sterling said.

Search Wikia and Wikipedia operate under two separate organizations that were both founded by Wales: Wikia is a commercial, for-profit company while The Wikimedia Foundation that oversees Wikipedia is a nonprofit entity. Wikia and Wikimedia operate independently of each other. Search Wikia is one of several projects and services at Wikia.

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection
Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Juan Carlos Perez

IDG News Service
Show Comments


James Cook University - Master of Data Science Online Course

Learn more >




Back To Business Guide

Click for more ›

Brand Post

Most Popular Reviews

Latest Articles


PCW Evaluation Team

Andrew Teoh

Brother MFC-L9570CDW Multifunction Printer

Touch screen visibility and operation was great and easy to navigate. Each menu and sub-menu was in an understandable order and category

Louise Coady

Brother MFC-L9570CDW Multifunction Printer

The printer was convenient, produced clear and vibrant images and was very easy to use

Edwina Hargreaves

WD My Cloud Home

I would recommend this device for families and small businesses who want one safe place to store all their important digital content and a way to easily share it with friends, family, business partners, or customers.

Walid Mikhael

Brother QL-820NWB Professional Label Printer

It’s easy to set up, it’s compact and quiet when printing and to top if off, the print quality is excellent. This is hands down the best printer I’ve used for printing labels.

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Featured Content

Product Launch Showcase

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?