Print Page - Measuring failover times

Title: Measuring failover times
Post by: SimonV on January 24, 2018, 03:17:33 PM

I'm wondering if there is any tool that will accurately measure failover times in the lab, possibly even down to the milli-second. Anything that's more accurate than a ping would be great... :)

Title: Re: Measuring failover times
Post by: deanwebb on January 24, 2018, 05:17:22 PM

What about debug logging? Or are their timestamps not specific down to the millisecond? Would syslogs be the answer?

Title: Re: Measuring failover times
Post by: icecream-guy on January 24, 2018, 09:03:09 PM

what constitutes failover time? there lies the problem, when the packets make it to the redundant server? or when the service is responding

Title: Re: Measuring failover times
Post by: SimonV on January 25, 2018, 02:35:42 AM

Quote from: ristau5741 on January 24, 2018, 09:03:09 PM
what constitutes failover time? there lies the problem, when the packets make it to the redundant server? or when the service is responding

When full end-to-end connectivity has been restored.

Something like iperf, with two nodes on both ends of the topology that you're testing, keeping a steady stream of timestamped traffic open and reporting back when exactly end-to-end connectivity has been restored. Thinking about it, might be possible using BFD for this.

Title: Re: Measuring failover times
Post by: wintermute000 on January 25, 2018, 06:11:30 AM

IXIA, spirent STC (L4) / Avalanche (L7) are two big name commercial traffic generator options.

Here's an open source one: https://trex-tgn.cisco.com/ (https://trex-tgn.cisco.com/)

This is a VERY deep field once you get into the weeds... be warned. The exact nature of the traffic will have a bearing on what you measure and what you're actually making the device do - this is especially true for security platforms. Example: I'm driving 2k concurrent TCP connections (RENO) @ ~70k connections with total throughput ~4Gb (lots of tiny 8kb HTTP GETs). It tells me I lost 15000 packets and 500 connections. What's the convergence time? LOLOLOLOLOL

For simple RS something like a spirent which generates a simple stream of packets, count the lost packets that's done - sure, but as soon as you get into stateful devices or real world traffic patterns hmmmm

Title: Re: Measuring failover times
Post by: dlots on January 25, 2018, 07:51:21 AM

I have used Jperf/wireshark for this. Just send a stream of UDP traffic as fast as possible with a destination of a PC, the PC captures on wireshark. Cause failure, once the fail-over is done stop the capture. Filter so that only traffic from the Jperf box is visible.

Go to view, time display format, seconds since previous displayed packet. Then sort by time. The largest amount of time there should be your fail-over time.

Networking-Forums.com

Professional Discussions => Routing and Switching => Topic started by: SimonV on January 24, 2018, 03:17:33 PM