|
Page 2 of 2
Which is better? Page views? Unique Visitors? Or Hits?
Understanding site statistics - Page 2
We have went over two different methods so far of evaluating web site
traffic but there is one more method used currently which is
independent or unique visitors. When a request is logged in the raw access log file it might look like this:
198.162.0.1 - - [09/Jul/2003:15:30:19 -0400] "GET /index.htm HTTP/1.1" 200 - "http://www.xtremewebhosts.com/index.php"
The first part of the line is the IP address of the computer which
requested the web page. At the end we can see that they requested the
index.php file. We can also see the date and time of the request in the
square brackets. We can also see that the code "200" (a successful
request and web page sent) was logged. (If the web page was not found
you would see an error code of 404 which means the web page wasn't
found. Another popular code is 304 which means that the web page has
not been modified since the visitor last visited.) The important part
is that whenever the computer located at the IP address of 198.162.0.1
requested a web page from the site their IP address was listed and
logged. If you went through the entire raw access log file and counted
up the "different" IP addresses you could find out a number of unique
IP addresses and have a rough estimate of the number of individual
people who visited your site. Counting these IP addresses will tell you
how many "different" people visited your site. So if we went back to
our examples and counted the number of different people who visited we
might find that the first site might have more unique people visiting
its site than the second one.
But, this is not a perfect method either. Simply counting up the IP
addresses might actually give you a lower (or higher) estimate of the
people visiting your site than in reality. This is because if you have
people who visit your site who use dial-up modems they are assigned
temporary IP addresses each time they dial in to the internet and this
would create a higher estimate of people visiting each time their IP
address changes. So if
you have one person who visits your site on one day and is assigned the
IP of 198.162.0.1 on one day and then assigned 198.162.0.5 on the next
day it will be logged as two different unique visitors when in reality
it was only one.
On the other hand, if you have someone visit from a computer
that has a static IP address and then they leave and someone else
visits from the same computer, this IP address will only be logged a
single time even though two or more people may have used it to access
your web site. Also, you might have one person dial in and use
one IP
to visit your site and then log off. And then have another person dial
in and be assigned the same IP and visit you. Either of these examples
will show up as one
unique IP because the IPs are the same even though there are two
different people visiting, and it will create a lower estimate of
people visiting your web site. As inacurate as these may seem, using
these methods however will give you a
generalized view of how many people visited.
For example if you viewed a quick snapshot statistics for xtremewebhosts.com you would find that the following was found:
There were 2,612 unique visitors who visited a total of 3,557 times who requested 11,236 pages and recorded 62,905 hits.
Which number is better? It really depends on what you want to know. We
can see that there were several thousand different IP addresses logged (give or
take some duplication or under estimating). We can also see that they
requested a little over 11,000 web pages (were some of these browser
refreshes?). And because of our site design there were a little over
62,000 individual requests in the raw access log file. So how do we
know which pages these people visited? And where did they come from?
How about how long did they stay on the site? And more importantly do I
have to count several thousand log entries in the file to determine all
of this? The answer to all of these questions lies within using a
graphical site statistics program. With a click of a button all this
information is at your reach. In our next article we will discuss three
of the most popular programs that will read your raw access log file
and provide you with information instantly (although our web hosting
plans have more than 3 of these types of programs). But before you
could understand the information coming from those programs you had to
understand the terminology displayed by these programs. And that is
what we have accomplished here.
None of these methods are perfect, but without knowing their flaws (and
their strengths) you cannot effectively tell what is happening on your
site. Each is useful in filling in one aspect of the picture and all the information they provide
should be taken with a grain of salt as all of these combined only show in
general how your site is doing, not exactly with 100% certainty of how many people are visiting.
For a really nice look at one of our clients visable statistics packages, have a look at http://www.maysvillebands.com/analysis.html which gives detailed information about who is visiting, page hits, page views, and demographics about its visitors.
|