http://google.stanford.edu/addurl
Google employs a concept of Page Rank derived from academic citation literature. Page Rank equates roughly to a page's importance on the Web: the more inbound links a page has, and the higher the importance of the pages linking to it, the higher its Page Rank.
Proximity (distance between query terms in the proximity) is taken into account on all queries.
If this is based on academic stuff, proper HTML will be absolutely essential.