Chris Richardson of WebProNews attended this afternoon's Successful Site Architecture session, and sent some tips along exclusively to WPN readers.
Barbara Coll, CEO of Webmama.com and a former chair of SEMPO, led the session on Successful Site Architecture, a critical aspect of getting one's site prepped for search engine optimization.
Coll stated a goal of getting a site ranked highly in organic search, with the idea of converting visitors into customers. That SEO process has to be considered throughout the beginning of the site design, as well as adjusted later when the site is up and running.
Having developers on-board who can make you site sit up and be noticed helps, but Coll suggests that those versed in SEO development are ones a site owner wants to have on the project sitting in front of a workstation. The good practices implemented through design will carry through the life of a site.
SEO has to be embraced by the whole organization, from the CEO to the developers and content providers. It's an ongoing process, not just a destination.
Derrick Wheeler, Director of Search Optimization for Digital Impact, spoke next. He touched on the concept of knowing the little things about one's site, from the domains and sub-domains in use to the most important keywords and phrases in play.
Wheeler emphasized the importance of using robots.txt files to keep search engine spiders from hitting "spider traps" on sites that dynamically generate pages. Robots.txt can be used to keep spiders from wasting cycles on those areas of a site and direct them to your optimized content pages.
Session IDs, cookies, and javascript can hinder spidering. Robots.txt can be setup to turn off session IDs for spiders. And when redirecting visitors to parts of a site, use 301 redirects. Wheeler also reiterated the point that spiders like text links for navigation best; too much of a graphical focus on navigation may work fine for visitors, but spiders will get lost, and so will your site in the index.
Internal links should be short, as spiders may ignore overly long links, a potential sign of search engine spamming. File directories for the site should be somewhat shallow, three or four levels below root at most.
Peter Norvig from Google stepped up next, and the interest of everyone at the session perked considerably. As might be expected, Norvig urged site owners to utilize Google Sitemaps to help get their pages indexed quickly.
He emphasized some old school design concepts, ones that the Google spider considers when crawling: keep html pages under 100kb in size, eschew the use of splash pages, and don't use password protection like .htaccess for pages you want the spider to crawl.
Norvig reaffirmed Wheeler's contention on 301 redirect usage; use 301 when moving a page, and a 404 page after the page has been moved.
Eytan Seidman from MSN Search kept it simple. Build great content, and unique content. Get feedback from your users, and keep analyzing your site for issues. Seidman emphasized that MSN Search will remove sites from the index if they seem to be more about optimization that content.
Spamming search engines, link farming, keyword stuffing, content cloaking, hiding text, and duplicating content on multiple domains will all draw the ire of MSN Search, as well as other engines, and get a site deleted from an index.