Does google only process the main pages? Because whenever I search, the only pages the will show up on my website are the main pages, and not any sub sections, and that tickes me off because thats where all the content is xD
This can depend a lot on the software that powers your site, and the format of your URLs..
Way back when, Google (and even now with some other search engines) they have problems with & and ? characters in URLs.
Because of bad coding (or just unthoughtful coding), many of these URLs are incorrectly passed on to databases. These characters may require escapes in some languages (such as ' and " needing to be changed to \' and " sometimes to be passed through PHP)..
Google seems to have gotten around this one now, and apparently has for a while.
Another thing to watch out for is sessions.
PostNuke, for example, is extremely difficult to get way up there in search engines (especially google). I was on the PN development & support teams for a short spell in '02, so I know many people running PostNuke sites. A few of them have much more content than I do on my site, yet Google only knows of a handful of pages on their site, yet it knows of over 3,000 on mine.
The problem is, PostNuke (and some other scripts including osCommerce, phpBB, vBulletin, and others), pass along the session in a certain way that Google appends it to the URL.
As google and pretty much any automated search-engine/web-sucker doesn't use sessions, a new one is created each time a page is loaded. Google could reload the same page a 1000 times and never see the same URL twice.
This can often be gotten around by placing the following line in a .htaccess file in your website's root directory..
php_flag session.use_trans_sid off
Something else I have done (partly because of the previously mentioned problem with & and ? characters in URLs, partly to simply shorten the URLs so they don't wrap in E-Mails, and a couple of other reasons) is to use the rewrite mod to fake some prettier URLs (Some scripts, such as osCommerce, have built-in support for masking real URLs with more search-engine-friendly ones).
With PostNuke, there was no native support for this, so I had to do a bit of hacking with the theme, and some of the core code/modules to get this working as much as possible.
To give you an example of this... here is PostNuke's default URL for the "News" module.
http://www.reptilerooms.com/modules.php?op=modload&name=News&file=index (Google Page Rank 0/10)
As you can see, this is quite long and ugly, and to a search engine isn't really obvious (and the URL of the page itself - including the domain name, and page's filename - CAN make a difference with regard to how high up the search it shows). Also, if somebody were to paste this URL in an E-Mail, it would wrap in a text based E-Mail (as many of the Reptile related Yahoo groups I'm on are), meaning two, sometimes more, lines of text would have to be copied+pasted into notepad or some other text editor and patched up to form a complete working URL.
With my modifications in place, this URL is altered to show as...
http://www.reptilerooms.com/News+main.html (Google Page Rank 5/10)
As you can see, this is exactly the same page. But as you can also plainly see, Google likes it MUCH more (which is obvious by the page rank its given it - although, the fact that there are several thousand other pages on my site linking to the fake URL rather than the actual URL might have something to do with that as well).
As others have stated CONTENT IS KEY! with regard to getting way up there. While there are sites out there related to reptiles that have much more content than we do, most of our content is unique. Research & articles that we have written.
For example, the
Yellow Fungus Disease article was entirely researched by us. Nobody else has done research into this disorder and published it on the web, all of the other people who want to inform others of YFD are linking to us, the
news about the new 9th species of bearded dragon (again, note the friendlier URL) that was discovered was an exclusive article as we know the gentleman who discovered it personally, the anouncement about the discovery of of a clutch of
Albino bearded dragons that have actually survived more than a few days (something that has never before happened) was news that we got out there first, and because we got it first, many others didn't even bother to mention it, and if they did, they linked to us, because we spoke with the person who owns this clutch and got photos that nobody else has.
And lastly, a little more than a week ago, a
Zoo in Indonesia claimed that they were in possession of a 49ft reticulated python. If proven true, this would be the world's longest ever recorded snake in captivity by a clear 18ft (the current record being 31ft, give or take). As soon as I found out about this, I put up a news posting on my site. A week later when this was
proven to be a hoax, and the snake was discovered to be closer to 21ft, I put up another news posting as soon as I had a reliable source to quote. I also linked to these articles in my
Reticulated Python care/info sheet (which has had more than 200 reads in the past week solely because people have been searching for "world's largest snake" and "49ft python" on Google - when other care sheets put up since then have averaged maybe 30-40 reads.
So, not only is content key, but also keeping that content fresh. If something comes up in the news related to your site's content or theme, mention it. Link it to other articles on your site, and have those other articles link to your news post about it.
There are many other modifications I have made to PostNuke in order to clean it up for Search-Engine-Happiness, but if I got into those, then this would turn into a real long post pretty quickly (oops, too late, hehe).
The main things to remember when creating a site with search engines in mine.
CONTENT - Keep it LARGE (with the most important bits at the top use key search words several times), keep it ORIGINAL wherever possible (try not to replicate content that 100 other people have on their sites), and keep it FRESH (make sure it stays up-to-date! The more often Google notices changes in your content, the more often it will spider your site and find new pages, and update your rankings).
TITLE -
USE those title tags. Don't just have <title>Bill's Website</title> on every page. Of course, name recognition is important, so feel free to have that on each page along with a descriptive topic of the page's content. For example..
<title>Bill's Website - Google Ranking Tips</title>
Or, often better..
<title>Google Ranking Tips - Bill's Website</title>
Having the description before the name of your site can make all the difference (especially if your site name is long, and Google truncates it to...)
Bill's Website - Goog... (who's gonna click on that?)
Google Ranking Tips - Bi... (oooh much more enticing)
DOMAIN NAME & PAGE FILENAMES : This is just as important as content & title. With identical content,
http://www.joespersonalsite.com isn't going to rate as highly as
http://www.reptilerooms.com in a reptile related search (usually).
index.php?something=whatever&this=that&somethingelse=what&is=this&stupidly=long&url=for
is not going to be as prominant as
descriptive-title.html
Even if it's just one or two words relating to the subject, that's what's important.
If you go to google right now, and search for "reptile webcam", guess who comes up in the #1 spot? Yup, it's me.
Because instead of pointing to some impossible-to-decrypt URL it points to the seemingly harmless (Yet descriptive) URL Webcam-main.html
Having the keywords in the URL REALLY does help.
Now, if I've not bored you all to sleep by now, let me state that these are simply my own personal observations through working on my site since it was switched to PostNuke in April '02.
It may simply be pure coincidence that my site's popularity has jumped as each of my changes has been implemented, and you may not receive the same results I have, but hopefully this will help somebody out there
PS. Please forgive any mistakes, it's 6:27am, I've been pulling an all-nighter working on my site (probably like the rest of you) and only stumbled across this forum a couple of hours ago, so I'm a little tired
Please feel free to criticize ANYTHING I've said in this post. Like any good (or mad, as my girlfriend claims) scientist, being proven wrong can be just as satisfying as being proven right, and I'm not completely closed off to the idea that there may be things I haven't fully figured out yet
