Friday, February 8, 2008

Importance of Robots.txt

Importance of Robots.txt

Robots.txt

Robots.txt file is a very important file if you want to have a good ranking on search engines, many websites don't offer this file. A Robots.txt file is helpful to keep out unwanted search engine spiders like email retrievers, image strippers, etc. It defines which paths are off limits for spiders to visit. This is useful if you want to hide some personal information or some secret files.

What is Robots.txt

Robots.txt file is a special text file that is always located in your Web server's root directory. Robots.txt file contains restrictions for Web Spiders, telling them where they have permission to search. A Robots.txt is like defining rules for search engine spiders (robots) what to follow and what not to. It should be noted that Web Robots are not required to respect Robots.txt files, but most well written Web Spiders follow the rules you define.

How to Create Robots.txt

The format for the robots.txt file is special. It consists of records. Each record consists of two fields : a User-agent line and one or more Disallow: lines. The format is:
":"

The robots.txt file should be created in Unix line ender mode! Most good text editors will have a Unix mode or your FTP client *should* do the conversion for you. Do not attempt to use an HTML editor that does not specifically have a text mode to create a robots.txt file.

User-agent

The User-agent line specifies the robot. For example:
User-agent: googlebot

You may also use the wildcard character "*" to specify all robots:
User-agent: *

You can find user agent names in your own logs by checking for requests to robots.txt. Most major search engines have short names for their spiders.

Disallow

The second part of a record consists of Disallow: directive lines. These lines specify files and/or directories. For example, the following line instructs spiders that it can not download contactinfo.htm:
Disallow: contactinfo.htm
You may also specify directories:
Disallow: /cgi-bin/
Which would block spiders from your cgi-bin directory.

There is a wildcard nature to the Disallow directive. The standard dictates that /bob would disallow /bob.html and /bob/indes.html (both the file bob and files in the bob directory will not be indexed).

If you leave the Disallow line blank, it indicates that ALL files may be retrieved. At least one disallow line must be present for each User-agent directive to be correct. A completely empty Robots.txt file is the same as if it were not present.

White Space & Comments

Any line in the robots.txt that begins with # is considered to be a comment only. The standard allows for comments at the end of directive lines, but this is really bad style:
Disallow: bob #comment

Some spider will not interpret the above line correctly and instead will attempt to disallow "bob#comment". The moral is to place comments on lines by themselves.
White space at the beginning of a line is allowed, but not recommended.
Disallow: bob #comment

Examples

The following allows all robots to visit all files because the wildcard "*" specifies all robots.
User-agent: *
Disallow:

This one keeps all robots out.
User-agent: *
Disallow: /

The next one bars all robots from the cgi-bin and images directories:
User-agent: *
Disallow: /cgi-bin/
Disallow: /images/

This one bans Roverdog from all files on the server:
User-agent: Roverdog
Disallow: /

This one bans keeps googlebot from getting at the personal.htm file:
User-agent: googlebot
Disallow: personal.htm

Tuesday, January 8, 2008

seo specialists hyderabad seo training centers hyderabad


What is SEO?
SEO (Search engine optimization) is simply "the use of search engines to draw traffic to a web site. It is the technique of attaining a higher ranking in search engines and directories via changes to a site to make it more search engine compatible".
How to choose the Right Domain Name?
Consider naming your company and registering a domain name starting with the digit 1. Better still, choose a name starting with "1st". Why? When people create directories of web sites, they have to decide how they are going to classify those web sites. One way to classify web sites is to list them on the basis of how "good" they are.
Why Google Banned Websites?
First let me show you how to see if you're clearly banned by Google. Often times people think they've been banned, when in reality they've just dropped in ranking and can't find their website.
11 Basic Ways for Website Promotion
The most important strategy is to rank high for your preferred words on the main search engines in "organic" or "natural" searches (as opposed to paid ads). Search engines send robot "spiders" to index the content on your webpage, so let's begin with steps to prepare your webpages for optimal indexing and website promotion.
Keyword & its Types & Keyword Density & its importance.
A word used by a search engine in its search for relevant Web pages is called Keyword. Keyword tags are an important element in all web pages. keyword tags are one of the best ways of optimizing the number of visitors on your site.
Importance of Robots.txt
Robots.txt file is a very important file if you want to have a good ranking on search engines, many websites don't offer this file. A Robots.txt file is helpful to keep out unwanted search engine spiders like email retrievers, image strippers, etc. It defines which paths are off limits for spiders to visit. This is useful if you want to hide some personal information or some secret files.
Pay Per Click (PPC)
Pay-per-click advertising is one of the most cost-effective methods of getting leads known to Internet business owners. It gives you instant traffic, and allows you to test your business model in real time.
What is Google Page Rank?
Google Page Rank is simply Google's way of displaying how important a webpage is. Google assumes that when 1 webpage links to another webpage, it's actually "casting a vote" for the webpage. The more votes you have for your webpage, the more important your webpage will be.
How to Calculate Page Rank?
Now let's talk about "almost" EXACTLY how page rank is calculated. Over 99% of the webmasters on the internet do not understand how Page Rank(PR) is calculated. I'm going to refer to Page Rank as PR throughout the rest of this article, for the sake of not having to type it over and over.
Web directory & its importance
A web directory is a collection of links broken down into relevant categories. Think Yahoo! and their directory, the Open Directory Project or even the Google Directory (which, incidentally, is pulled from the ODP).
What is a Blog?
In 2003 blogging was a novelty. In 2004 it became a trend. Today, having a blog is a business necessity. Whether you are looking to voice your opinion on an issue you feel strongly about, or you're looking to explore your creative side, or promote yourself as an expert in your field, having a blog has fast become one of the prime markers of status and business nous.
What does Google really want?
Why Google, you ask? What a stupid question, you might also ask. Well, let me explain myself. However, while I do so, keep this question in mind and try to answer it alongside me.
What is the Google Sandbox Theory?
There are several theories that attempt explain the Google Sandbox effect. Essentially, the problem is simple. Webmasters around the world began to notice that their new websites, optimized and chock full of inbound links, were not ranking well for their selected keywords.
What is RSS?
RSS stands for “Rich Site Summary”, although other terms such as “RDF Site Summary” (which emphasizes the file format) and “Really Simple Syndication” (which highlights the main selling point of RSS) are also useful in defining RSS by the book.
Importance of Sitemap
Building a perfect sitemap is a lot like building a perfect website - you need to account for contextual grouping of your pages, hierarchical linking between your pages, and most importantly, a clean, concise format that provides search engine spiders with a super-fast blueprint for indexing your website.
How to increase your Website Traffic?
This question is so common though, here SeoMaterial.com briefly describe some common way to increase your website traffic.
What is over optimization?
Over-optimization happens when your website is considered “too good” by Google – either in terms of a sudden volume of backlinks, or because of heavy on-page optimization. In other words, if Google considers that your website optimization is beyond acceptable limits , your website will be red-flagged and automatically restricted or penalized.

Search Engine Optimization (SEO) can be stated as a highly specialized process of building a successful website. We say successful because if a commercial website cannot be found in the major search engines, it is not successful, it just isn't doing its job.

Search engine optimization (SEO) and Search engine marketing (SEM) are now such a growing part of the main marketing objectives for companies of all sizes

Search Engine Optimization is just one part of an online marketing strategy, but it is the fundamental part. Search engine placement and keyword-related marketing can account for 85 to 95 percent of your overall Web traffic. Our team of Search Engine Optimization experts knows how to significantly improve your company.

Our leading-edge technology and our search engine optimization techniques are currently used by companies that want to fully maximize their Search Engine Optimization, while significantly boosting their Web visibility in the major search engines.


Adding relevant titles, descriptions, characters formatting and keywords.
Adding a Site Map to your web site.
Google Page Rank (PR).
Links Exchange Strategies
and more…

SearchEngine
AltaVista
AOL Search
Ask
Google
Lycos
MSN Live Search
Netscape
Yahoo moreinfo