XFire helps pinpoint Web application bugs
Posted: 08.02.04Modern Web applications are a conglomerate of many individual moving parts. Web server software and hardware, application servers, database systems, custom developed applications, a variety of network devices, the network transport and, of course, the ever-present Web browser all present opportunity for failure. Unfortunately, when a Web application fails, finding out exactly what caused the bug can be a peskier task than fixing the bug itself.
In our test, Xaffire's xFire 3.0 platform attempts to provide a holistic view of a Web application's health and usage to assist in bug isolation. The product mostly succeeds in this task by tying together a range of data, although a few rough edges in its interface and overall approach hold it back at times.
To properly debug any software application - Web-based or not - the problem must be reproduced. Yet when it comes to Web applications, simple log files often don't provide enough information, as you need to know what the user did and the context in which they did it. You need to know not only the series of requests that might have led up to a problem, but also any data a user might have entered, the type of browser they used, and the states of the server and the network at the time of the request.
XFire attempts to provide an instant replay of a Web application problem by capturing and correlating data at the network level and the Web server level. Some data is fairly easy to collect from various logs and network-monitoring interfaces, but to collect actual user session data, xFire needs an interceptor in the form of an Internet Server API filter for an Internet Information Server (IIS) or an Apache module for an Apache-based system. We ran our IIS testing on an IIS 5.0-based system, as the collection filter has not yet been ported to IIS 6.0. Xaffire officials indicated the IIS port is well underway.
One caveat with the filter/module approach for collecting user data is that it will induce some minor performance degradation on a server, particularly if it is under significant load. That seems to be a minor penalty given the payoff of being able to actively debug a live application.
XFire also collects other infrastructure-related data, such as the health of a Web server, application server, database system or an SNMP-capable network device. All the data from monitored sessions and systems is collected at an xFire archiver appliance for monitoring and later analysis. The archiver appliance has a management console that provides access to the three major aspects of the xFire approach: infrastructure monitoring, synthetic transaction deployment and monitoring, and live session capture and playback.
The infrastructure monitor provides a simple graphical overview of the various resources that might be found in a Web farm. Some monitors are very basic ping or SNMP monitors, while in other cases with databases such as MySQL or application servers such as Tomcat, more detailed measurements are taken.
Discovery and configuration of these monitors is generally straightforward, but be forewarned given the range of versions available - particularly for open source platforms - you might have to tinker with settings, as we discovered when one of our versions of MySQL couldn't be monitored immediately.
While the infrastructure-monitoring aspect of Xaffire's offering certainly is not going to compete with a more network-operations-focused platform such as Computer Associates' Unicenter, it provides reasonable system availability indicators for those interested in application-level activity. These indicators are helpful, because in most companies, the person debugging a Web application is often not always the person who can access an enterprise network management platform.
The synthetic-monitoring feature lets network managers create test scripts to be played at various intervals to exercise a site's functionality. A service can be employed in the offering so that recorded transactions can be played not only from the local environment, but also from remote beacons on the Internet.
While the synthetic- and remote-monitoring aspect of the product doesn't represent actual user traffic, it is very appropriate. The health of the synthetic transactions provides insight into whether the network might be part of the problem the user is encountering. Like the infrastructure component, the synthetic transactions with local and remote monitoring xFire provided easily can be bested by specific services such as Keynote or Gomez, but xFire's direct integration with its other reporting services makes the feature very useful.
The product shines in its user session capture and playback capabilities. XFire provides a wealth of useful information showing full request and response headers and data. Once a particular session is found by searching by time or hit - which can be difficult if you don't have an exact time or a unique indication such as a logon value - you can replay the session to observe very much like the user did.
Unfortunately, playback is limited to an Internet Explorer-based environment and really doesn't address any form of browser-specific, client-side problems you might encounter. This could be a significant problem for a site that relies significantly on client-side technology such as JavaScript, which might act very differently depending on the browser. Given the likelihood of potential client-side problems, particularly the rise of richer Internet-based applications, future Web application debugging environments such as xFire should add client-side debugging features to the mix.
While the playback obviously is not an exact duplicate of what a user might see, it is close enough to be useful for a majority of Web applications.
During testing we used the product in a live scenario and determined that a reported instability of a monitored e-commerce site was not related to user activity, but instead unstable software. The problem wasn't solved immediately, but the culprit was determined primarily by using xFire in conjunction with some log file pattern analysis. This made problem identification a few hours' effort vs. the often few days' effort.
While xFire proved itself in real use, a significant downside of the xFire system is that the interface is just plain unintuitive in places and often too cluttered. While it might be good enough to solve a problem, the console would benefit significantly from usability testing and user interface re-factoring.
On the bright side, the TiVo-like interface that let us see user data at a particular time was appealing. Although we wonder if a high-traffic Web site would limit its usefulness because it would be too difficult to find a particularly interesting session among the plethora of captured data.
While the xFire platform has rough edges, its integration of many disparate pieces of Web site user, system and application data shows that the whole is often greater than the sum of its parts. Together, this information could be used to diagnose serious Web application problems that users encounter.
NetResults
xFire 3.0
Xaffire
(303) 642-4477
www.xaffire.com (Outdated)
Price: Starting at $25,000 for two Web servers.
Overall Rating: 3.35
Pros: Powerful combination of disparate data sources; clear focus on problem resolution; integration with synthetic transactions and monitoring.
Cons: No IIS6 support; unintuitive user interface; restricted to filter-based approach for session capture.
How we did it
We tested the xFire platform in a mixed IIS and Apache environment with a variety of middleware environments, including Tomcat, ColdFusion, Active Server Pages and ASP.NET and databases including mySQL and Microsoft SQL Server. The bulk of the testing was performed on an internal IIS5 platform running numerous Web sites and an externally hosted IIS5 served e-commerce site developed in class Active Server Pages using numerous Internet Server API filters.
Originally published on Network World, Published: August 2, 2004.