Product
Product
- Application Performance Monitoring
  
  A complete, detailed view of your applications and operating environment, showing key metrics from transactions, system resources, and applications
  
  Performance Profiling
  
  Continuous profiling ensures that you get a deep insight into your application performance, memory, CPU utilization and thread conditions
  
  Compare FusionReactor On-prem with FusionReactor Cloud
  
  Keep data local or send it to the Cloud – which should you choose?
  
  Database Monitoring
  
  See individual SQL statements, number of rows returned, DB time used to quickly identify performance issues related to queries
  
  Production Debugging
  
  Get unparalleled insight into production runtime with tracepoints and breakpoints. Automated root cause analysis down to the line of code
  
  Migrate from On-Prem to Cloud
  
  Open up a new world of monitoring capability for the same monthly price
  
  Log Monitoring
  
  Search and analyze your application and system logs – troubleshoot issues and identify performance bottlenecks using rich log context and visualizations
  
  OpsPilot AI
  
  Optimize your applications, identify the root cause of problems swiftly and take corrective action at lightning speed using natural language and revolutionary AI
Download
- Latest Version
- Start Free Trial
Pricing
Support
About Us
Resources
Login

Start Free Trial

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting

admin

Published: 6 years ago

FusionReactor 7.4.1 adds support for Async functionality in ColdFusion 2018

FusionReactor APM is sponsoring the 2018 O’Reilly Velocity conference in London 30 Oct – 2 Nov

A customer recently wrote into the FusionReactor Support Team regarding their ColdFusion server, which was crashing on average once per week. We have used FusionReactor and specifically the archive viewer, to take the customers archived log data and diagnose the root cause of the issue.

This blog will explain the process of how it is possible to diagnose the root cause of the crash with the Archive viewer, a feature introduced in FusionReactor 7.2.0 that allows users to display and graph historical log data in a similar fashion to viewing data in the “live or recent” monitored application server.

Starting by viewing the resource log it was possible to see that the total memory allocated by the JVM was exceeding the maximum memory configured in the ColdFusion server:

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting, FusionReactor

Resource.log

From there it was then obvious that there was clearly an issue with memory usage in the server, looking in the memory summary logs it was then possible to see that the actual memory usage was only 20% of the usable memory, however, the committed memory was high.

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting, FusionReactor

Memory Summary Usage

The fact that the heap usage was only 20%, yet the committed memory was high points to the fact that a particular memory space was attempting to use more memory than it had available. At this point, it was then possible to log at the logs for each available memory space. From this, it was possible to see that the old gen memory space allocated had committed 14GB of its available space:

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting, FusionReactor

OldGen Memory Usage

The Old Gen memory space is used for long living objects that are committed to the JVM for longer periods of time, this includes objects such as the ColdFusion scopes and classes that are loaded into memory, this space is only cleared when major garbage collections occur within the JVM.

In a typical ColdFusion server, you will see considerably more minor garbage collections than major garbage collections, however using the gc (garbage collection) logs it was possible to see that large garbage collections were occurring on average every 4 minutes and taking between 750 ms and 1.5 seconds each time:

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting, FusionReactor

GC Marksweep times

Time spend on major garbage collections in Java will consume large amounts of CPU and reduce the performance of the application server.

Using the classes log file it was then possible to see that there was a very large number of classes loaded into the JVM, which would explain why 14GB of OldGen memory was committed to memory and the JVM was no longer able to allocate the required memory needed to run the application:

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting, FusionReactor

Count of loaded classes

It is important to note that the fact that the number of loaded classes is high is not the only cause of the Application server running out of OldGen Memory, but it is indeed a factor. In the Case of this ColdFusion server, the OldGen will contain the session scops, user session tracking, all loaded classes and many other objects that will need to exist for a longer period of time.

However, the fact that classes are being loaded periodically in this manner is highly irregular and would be cause for concern. If I take another CF server that is healthy I would see something similar to:

Using FusionReactor’s Metric and Log Archive Viewer for post crash troubleshooting, FusionReactor

Example of the count of loaded classes for a similar ColdFusion server that is in a healthy state

From this it is possible to see that there is a much lower class count and most importantly the graph is flat, this is what I would expect as classes committed to the OldGen space should not be cleared and reloaded into memory this often, Only if Java believes the classes loaded are too old and there is a potential that the bytecode has changed.

From this information, it was then possible to recommend increasing the maximum heap space available on the JVM and potentially the available resource for OldGen memory, trimming down the running CFML applications would also be a possible solution to reduce the Old Gen and heap usage of the running JVM.

Recent Posts

Optimizing MTTR in modern IT environments: The role of FusionReactor’s open-source observability

Easy multi-stream monitoring with FusionReactor

Simple setup for FusionReactor: Your essential guide

FusionReactor’s Event Snapshot Feature: A Detailed Overview

Rethinking Observability for Adobe ColdFusion developer teams: Balancing legacy efficiency with modern demands and the essential role of database monitoring

Enhanced system monitoring is made easy with FusionReactor’s new Observability Agent

Why Distributed Tracing is critical to faster troubleshooting and performance optimization

Easier anomaly insights: FusionReactor’s intelligent detection tool

Listening to you, innovating for you: Introducing FusionReactor’s upgraded billing plans

Optimizing performance: The benefits of FusionReactor’s 5% trace sample rate