Which Web Statistics Package Would You Use? 83
ken-doh asks: "We host about 200 customers web sites on a Windows platform, we want to provide them with a simple web statistics package, to track hits and other useful pieces of information. We have been using Deepmetrix LiveStats XSP which has been perfect for our customers, but since Microsoft purchased it, the product is no more, with support ending next year. So we need to buy a new stats package. Any ideas?"
awstats all the way (Score:5, Informative)
Re: (Score:2)
Re: (Score:1)
So very unfriendly. So very, very insecure.
I'd love to recommend webalizer, which excels in some of those ar
Re: (Score:2)
Re: (Score:1)
Now if you're using a non-standard approach, and allowing people to hit the awstats cgi directly, then you suffer from this issue. But that only really works on small sites, due to performance issues.
Re: (Score:1)
Also, don't click on referral links in the web logs!
The setup for multiple domains is a pain, but necessary with any other stats package I've used.
Re: (Score:2)
I'm not familiar with the package you're discussing, but anything that produces clickable links that it's dangerous to click on sounds to me like a disaster waiting to happen.
It's a potential information leak, that's all (Score:4, Insightful)
Re: (Score:2)
Sure, its free and open source, but I'll take useful over free any day.
Re: (Score:2)
Re: (Score:2)
Re: (Score:1)
Re: (Score:1)
Re: (Score:2)
Re: (Score:2)
why fix it? (Score:4, Insightful)
or is that not an option for some reason?
-nB
Re: (Score:1, Informative)
or is that not an option for some reason?
-nB
You missed something:
" Microsoft purchased it, the product is no more, with support ending next year."
Re: (Score:2)
-nB
on a completely unrelated note,
Re: (Score:1, Flamebait)
So it automatically quits working?
That's genuinely the advantage of running a Microsoft product :)
Re: (Score:3, Funny)
Re: (Score:3, Informative)
there's a couple of good tools (Score:4, Interesting)
I've seen lots of different packages and frankly I sometimes wonder why people pay for them. They're typically (actually I guess they're literally) off-the-shelf stuff that, while offering nice and interesting features, don't cover everything for everybody. I think it's a "you get what you pay for mentality", i.e., people insist on buying packages to do this kind of analysis.
I've written probably more than 20 different web filters for various analyses because the OTS stuff didn't get me the info I wanted.
And for any more-than-small IT staff, there's always someone there who knows the tools, and can slap together stat info and tweak it ad nauseum until management sees the analysis they think they want. Lots of staff will even write it on their own time -- they like to tinker with that stuff.
Also, though I haven't looked, I'll bet there are some great CPAN modules that get you what you want as a good start with the added benefit of having the code for your own tweaking.
Considering the article specifically is asking for simple web stats, I think sed, awk, perl, and others is a perfect way to go.
Or, you could buy yet another package and risk Microsoft buying that product and disappearing it.
Re: (Score:2)
This would not give you statistics that are probably a lot more important, such as:
o Conversion rates for your advertising (are we getting more out of advertising with company X than we pay for it?)
o What yo
Re: (Score:2)
Be careful. I work in an environment that was full of those kinds of things. Almost none of the admins or programmers stuck to any standards, naming schemes, or left any documentation. Most didn't even speak to one another. Thus, when I came onboard, I had to deal with a statistics collection system in multiple pieces in several different languages; many at the same time (a bash script that ran Perl scripts AND some compiled C modules on Cygwin on
one to avoid (Score:3, Informative)
Re: (Score:2)
I do use SmarterMail, and while the regular mail filters are neat, their spam filters are horrible! At least 10x more spam gets through than the default CPanel SpamAssassin (which is not so great compared to a nicely tuned one).
I didn't try this in the current version, but in a previous version, if I tried to erase a hundred messages through their web interface, the server would stop re
Re: (Score:2)
My issues with Analog are that I haven't discovered how to make it only parse each log file only once, and I haven't discovered any way to have it display stats for different time periods (ie. daily/weekly/monthly/quarterly/annually) all on one page. I'm not sure if these are real faults with the program, or if I just didn't figure out how to do it yet,so YMMV.
Google analytics (Score:2)
I'd put a link, but, c'mon. google.com
Re:Google analytics (Score:4, Informative)
Google Analytics is a good solution if you are looking for tracking based on javascript / web bug images.
Since they are looking for something that works off the server log files (such as LiveStats) maybe they should look at Urchin [google.com] which runs locally and processes the log files. Google purchased Urchin to make their Analytics offering. Unlike Microsoft and LiveStats, however, Google still sells and supports the Urchin software through retail partners.
Pricing is fairly reasonable, and is based on log sources and websites monitored. $895 buys you 100 profiles with one log source each. $695 extra per additional log source (i.e. if you've got 3 servers serving one website you'd need 2 additional log sources) regardless of the number of profiles. $695 extra per 100 additional profiles, as well.
They also offer campaign tracking and ecommerce reporting modules.
One thing that's impressed me about the program is the speed. We're using it on 150 profiles (with a maximum of 6 log sources per profile, though only one profile actually uses that many. Most of our profiles use only one log source) and it takes about 8 hours to process the logs each day from a central box using smb/cifs to pull the data files.
Re: (Score:1)
As a Webtrends refugee, I found Urchin to be a gift from heaven, it really rocks the house.
However, Urchin (as of ver. 5.x) does rely on Javascript and bugs.
Re: (Score:2)
Re: (Score:2)
It's also incompatible with Google's Analytics product. We regularly run the logs without the UTM (that's what they call the javascript piece) and instead setup the Google Analytics tracker instead. This lets us get historical data for sites and gives us better bandwidth usage stats for individual websites, while still allowing the end users to see the pretty graphs and awesome filtering Google gives them
Google analytics does not track me. (Score:2)
Re: (Score:2)
If most people went to the extremes you do for avoiding tracking, then yeah... I could understand the need to do the tracking locally. But most don't.
Re: (Score:2)
Anlog (Score:2, Interesting)
Re: (Score:1)
Re:Analog (Score:1)
AWstats, Google Analytics, and custom reporting (Score:3, Informative)
Google Analytics [google.com] is a little more sophisticated tool that requires you to embed a little bit of their code on every one of your pages. Also free to use.
For totally custom reporting, move your log data to the database following the guide I wrote earlier this year [kuro5hin.org].
Re: (Score:2)
Re: (Score:2)
I'll be sure to make a mental note that my stats are now 0.000125% inaccurate.
Re: (Score:2)
The percentage may not be so insignificant. For instance, the very popuplar AdBlock Plus [mozilla.org] add-on for Firefox (among the recommended add-ons [mozilla.org]) includes several blacklists to which you can subscribe. Most of these blacklists include the site google-analytics.com and patterns for blocking scripts such as urchin.js.
I guess that a fair share of Fire
Lies, damn lies, and statistics... (Score:2)
Re: (Score:2)
Re: (Score:2)
NeoThermic
Hits vs Page Views (Score:2)
Re: (Score:2)
Re: (Score:2, Informative)
Re: (Score:2, Insightful)
Webstats can be useful for showing broken links (Why so many 404s for this file? Oh crap, Sally renamed it). They can also point out commonly mising files (robots.txt, favicon.ico, sitemap.xml or whatever). Web stats can also be used for optimization -- seeing 4000 hits with only 30 visits might mean you are using way too many images. (So
Re: (Score:2)
Visits are time-limited uniques. If I read 5 stories on Slashdot within an hour, that's 5 hits and 1 visit.
Uniques are longer time-limited users. If I read 5 stories on Slashdot at 9 a.m., and 3 more at 4 p.m., that (at least in most packages) should be 8 hits, 2 visits, and 1 unique. Some packages will let you alter the timeline of Uniques, e.g., Weekly Uniques.
Ge
form the readme for Analog 6.0: How the web works (Score:2)
This section [analog.cx] is about what happens when somebody connects to your web site, and what statistics you can and can't calculate. There is a lot of confusion about this. It's not helped by statistics programs which claim to calculate things which cannot really be calculated, only estimated. The simple fact is that certain data which we would like to know and which we expect to know are simply not availabl
Re:form the readme for Analog 6.0: How the web wor (Score:1)
That whole page is well worth reading.
Many of the web stats packages other than analog really try to make you think they can get more data out than they really can.
That page and the one above it (What the results mean [analog.cx]) should be required reading for anyone about to read a web stats report. I certainly send it to all my customers whenever I set them up with a report.
CACTI (Score:1)
DJ Format & MC Abdominal would say (Score:2)
Yo, that is a lot of hits."
There's 45 hits in this song not counting the title.
http://www.youtube.com/watch?v=B38c1e52vfY [youtube.com]
Re:DJ Format & MC Abdominal would say (Score:1)
Good old Webalizer and newer stuff (Score:5, Informative)
Awstats seems to be the modern usual answer (http://awstats.sourceforge.net/ [sourceforge.net]), used and recommended by many admins and groups (in my case EGEE, European Science Grid intiative http://www.eu-egee.org/ [eu-egee.org]) but for traditionalists with no eye-candy desires, there is a copy of Webalizer (http://www.mrunix.net/webalizer/ [mrunix.net]) lurking on most servers and almost all destribution package repositories. It's worth looking at the wikipedia page for specials, extended verions and general info on web server statistics and analysis: http://en.wikipedia.org/wiki/Webalizer [wikipedia.org].
Particularly, Stone Steps Webalizer is an interesting version of feature-full and candy-enabled version: http://www.stonesteps.ca/projects/webalizer/ [stonesteps.ca]. Others can be easily found on Freshmeat: http://freshmeat.net/search/?q=webalizer§ion=p rojects [freshmeat.net] (i.e. Webalizer Extended with included Geolizer and extensive 404 analysis support, http://www.patrickfrei.ch/webalizer/ [patrickfrei.ch] and AwFull with usability, CSS and geo-ip features, http://www.stedee.id.au/awffull [stedee.id.au] etc.).
Others can be found on Freshmeat (117 hits at this time http://freshmeat.net/search/?q=web&trove_cat_id=24 5§ion=trove_cat [freshmeat.net]) and Wikipedia (very short and poor stub of a list that you might want to improve after your extensive testing :-) : http://en.wikipedia.org/wiki/Category:Free_web_ana lytics_software [wikipedia.org].
There is also Sherlog, an Apache Log Analyser, specialized in user experinece tracking more than statistcs - an interesting complimentary tool (http://sherlog.europeanservers.net/ [europeanservers.net].
Omniture, if you can afford it (Score:1)
Don't have to deal with web logs, always updated in real time, AMAZING functionality. Just pricey. our company found most of the open source or cheaper ones to be a bit lacking in functionality...just depends on your needs.
Re: (Score:1)
Re: (Score:1)
Re: (Score:1)
My own (Score:2)
obvious to me that any of the common ones supported aggregating stats across domains / hosts. Eg, I have 10 virtual servers on this
Apache [apache.org] box, give me a sorted list of hits per domain/host. Probably one or more of the popular open-source stats packages [wikipedia.org]
*does* do this, but I didn't feel like spending hours examining different ones and installing them. Since my needs were very basic
I just wrot
Next time you'll know (Score:2)
With a proprietary solution, the customers are the ones who support the product, and are then shafted when the product is discontinued.
TraceWatch if you want a really good one (Score:1)
Sawmill is nice (Score:1)
WebTrends 8.0 (Score:1)
Problem (Score:2)
What problem are you trying to solve?
This appears to me to be another project looking for a problem to solve. This is way too common in IT. Application cruft on the PC occupying memory, and generally bogging things down. Then the users complain that the network is too slow. Then you have to buy new hardware, and the new hardware needs new cruft... Lather, Rinse, Repeat.
Of course, this could also be a "Review Fodder" project with the goal of adding a new line to your self-assessment paper to the boss.
answer from an ex-DeepMetrix employee (Score:1)
Mint (Score:2)
Easy (Score:1)