Linux News

you are here: Home Web Search by the people, for the people. YaCy.

20 -May -2012 - 23:20
Tuesday, 29 November 2011 16:26

Web Search by the people, for the people. YaCy.

greg_head_normal By Greg Sayle

Start

I have only installed YaCy in the last 30 minutes and started playing with it. But, I gotta say this certainly has some powerful tools to see what is going on behind in my browser in the world of search.

This will take some time to work through all the features. I will post more details about YaCy

My YaCy web search page in my Chrome browser

yacy-web-page

My YaCy Server Access Grid in Chrome browser

yacy-greenlinux-access-grid

My YaCy Event Terminal

yacy-terminal

Installation of YaCy on Debian or Ubuntu

It is also possible to load packages using the update-server at:

http://debian.yacy.net

To use this server write to /etc/apt/sources.list:

deb http://debian.yacy.net ./

then you can install YaCy with

apt-get update
apt-get install yacy

After the installation, the yacy application path is at

/usr/share/yacy

The DATA-path is at

/var/lib/yacy

When you have installed YaCy that way, YaCy is started automatically after a OS startup, and stopped before shutdown.

You can also start and stop YaCy from the command line with:

/etc/init.d/yacy stop
/etc/init.d/yacy start
/etc/init.d/yacy restart

The YaCy web server runs on port 8090 by default. The administration pages are at

http://localhost:8090/

YaCy is a free search engine that anyone can use to build a search portal for their intranet or to help search the public internet. When contributing to the world-wide peer network, the scale of YaCy is limited only by the number of users in the world and can index billions of web pages. It is fully decentralized, all users of the search engine network are equal, the network does not store user search requests and it is not possible for anyone to censor the content of the shared index. We want to achieve freedom of information through a free, distributed web search which is powered by the world's users.

yacy1-status

Decentralization

Imagine if, rather than relying on the proprietary software of a large professional search engine operator, your search engine was run by many private computers which aren't under the control of any one company or individual. Well, that's what YaCy does! The resulting decentralized web search currently has about 1.4 billion documents in its index (and growing - download and install YaCy to help out!) and more than 600 peer operators contribute each month. About 130,000 search queries are performed with this network each day.

Search Engine Technology

YaCy is a complete search appliance with user interface, index, administration and monitoring. The following diagram shows its components: 

YaCy-MainComponents

YaCy harvests web pages with a web crawler. Documents are then parsed, indexed and the search index is stored locally. If your peer is part a peer network, then your local search index is also merged into the shared index for that network. If a search is started then the local index contributes together with a global search index from peers in the YaCy search network.

Peer-to-Peer Networking

YaCy peers continuously exchange index fragments using a Distributed Hash Table. Index data can therefore reach the local peer even before a user query is submitted, but of course it is still loaded from the remote peer network too when needed. 

YaCy-DHT

Read 422 times

Media