Diagnose Network Problems with Integrated Network Visibility

November 07 2017
 

Read more about how AppDynamics Integrated Network Visibility can help facilitate collaboration between teams and bring down mean time to repair (MTTR).


More and more distributed apps are being deployed in the private, hybrid, and public clouds, and the performance of these apps is becoming increasingly critical for enterprises.

In fact, the AppDynamics 2017 App Attention Index highlights the modern day consumer demand for speed and consistency, with 62 percent of respondents expressing increased expectations for how well digital services should perform. What’s more, when apps don’t perform correctly, 80 percent of users will delete the app. Needless to say, the bar for application performance is extremely high.

AppDynamics APM is well-equipped to monitor the performance of these apps, pinpointing app flows that degrade the end-user experience through the lens of the Business Transaction (BT). However, operations teams triaging problems are always challenged with the question of whether the underlying network is the cause of the degradation.

Enterprises typically have dedicated teams to manage the infrastructure (including network) and apps, but these teams don’t necessarily speak the same language, thus creating a communication barrier. AppDynamics Integrated Network Visibility attempts to facilitate collaboration between teams and bring down mean time to repair (MTTR). It’s a solution that is designed to enable AppOps to identify network-level problems during the “First Call” and escalate it to the right network team with actionable information. It also seamlessly integrates with application flow maps and directly correlates network performance metrics with application performance metrics, all within the context of business transactions.

Dynamic Dashboard for Network Visibility

One of the standout features of Network Visibility is the Dynamic Dashboard – a set of widgets showcasing trends of Transmission Control Protocol (TCP) connection metrics and host-level TCP socket metrics for selected time ranges. It also includes native metrics like Throughput, Loss, Data and SACK Retransmissions, TCP Resets, Connection Information, and uber metrics (single representation of a bunch of related metrics) like Network Errors and Performance Impacting Events (PIE). For example:

  • Network Errors bundles FIN Errors, Syn Black-holes, Syn Resets and RST on Established which captures errors that can occur on init or teardown of TCP connections.

  • PIE coalesces Client Zero Window, Client Limited, RTOs, Server Zero Window,  and Server Limited which help highlight symptoms of a problem on the client node, server node, or the path between them. Full list of dashboard metrics can be found here.

With this data, you can now identify the contribution of the underlying network infrastructure. For example, consider a stalled transaction on your application flow map. With Network Visibility, users can launch this dashboard for the affected Tier / Node / Link and gain insightful network information, including:

  • A spike in the Latency trend, which could indicate a sluggish TCP connection between two services.
  • An uptick in Retransmissions, which could indicate network congestion.
  • High values of Client / Server Limited, Client / Server Zero Window or PIE, which could imply inadequate TCP window sizing (Back Pressure) and a need for TCP optimization.
  • “Network Impact on Transactions” juxtaposes PIE and Network Errors against Transactions, so network contribution for afflicted transactions can be identified.
  • Network Errors and Connection Information widgets, which help identify issues with TCP connections and their lifetimes.
  • Host Stack KPIs widget, which has metrics like Interface collisions & Wait Sockets which can help unearth issues in NIC or Duplex configurations.
  • Throughput, Loss and Latency widgets, which highlight the network health of the selected entity.

 DD-Launch.gif

Snapshot Correlation

As the name implies, Transaction Snapshots is a popular feature in which AppDynamics retains a snapshot of certain transaction instances. This could be triggered by an automatic detection of slow transactions or a user-driven diagnostic session. A transaction snapshot gives you a cross-tier view of the processing flow for that particular transaction.

Transaction Snapshot drill downs will come with a network tab for the dynamic dashboard which will allow you to correlate network metrics captured at the time of snapshot collection. Each chart has the snapshot time range highlighted. You can then look for correlations in these charts and drill down to the root cause.

DD -SnapShot.gif

With integrated network visibility now running alongside the APM metrics you rely on to run your business critical applications, you can easily switch to a view of critical network performance indicators for your tiers, nodes and the flows between them.

Learn more about network visibility or start a free trial today.

Vinay Gangoli
Vinay Gangoli is a UI Engineer at AppDynamics. He has a M.S in Computer Science and has been working on interactive web-based applications for over 13 years. He is currently building user-friendly data visualizations at AppDynamics to meet the user demand for paradoxically simpler UX and faster load times. Outside of work, he likes playing tennis and seeing the world through his 4-year-old daughter's eyes.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form