Thursday, November 10, 2011

How to determine two performance runs are statistically different?

Performance tests are good examples of Normal Distribution. A normal distribution of data means that most of the examples in a set of data are close to the "average," while relatively few examples tend to one extreme or the other.

According to statisticians, Two tests are considered statistically different if it is unlikely to have occurred by chance. we can be 99% sure that averages from two runs are really different only when the other average is more than 2.57 standard deviations away.


The same principle can also be applied to performance tests to determine results obtained are statiscally different.


For example, If the Average response time for transaction A is 1 sec during the first run with the standard deviation of 0.2 and for the second run average response time for the same transaction A is 1.2 sec. The difference between the two transaction response time is 1.2 – 1.0 =0.2, which is 1 standard deviations away (Standard deviation observed during the first run is 0.2) on the average. If the difference is 2.57 standard deviations away(we are 99% sure) then the results are statistically different.

 
Conversely we can even calculate the statistical limits using the formula below
If R1 is average response time and SD is the standard deviation for a performance run then the average response time for the second run should not exceed R1 + (SD* 2.33) or should not be less than R1 - (SD* 2.33). For the above example response time for the second run should not exceed 1 + (0.2* 2.33) =1.466 on the positive side of the average and 1 –(0.2*2.33) = 0.534 on the negative side of the average.

Working with LoadRunner Sybase Ctlib protocol

The goal of this document is to bring together necessary information to help those users who are involved in scripting Client Server Database protocols using LoadRunner.


The document can be used as a basis for scripting, enhancing and debugging any of the following protocols

Ø Sybase CTlib
Ø Sybase DTlib
Ø Informix
Ø MS SQL Server
Ø Oracle 2-Tier
Ø ODBC
Ø DB2 CLI
Ø ERP/CRM Siebel Vuser scripts


This document is based on my experiences using LoadRunner with Sybase Ctlib protocol

Click here to download the document

End to End Performance Test Approach - Part 2

During the test planning activities, gather the performance test objectives, calculate resource estimations and project timelines, and review the Architecture with individual team members and determine the types of tests required to test the application.

Capture the project specific information for each of the projects in the test plan as per the following template. This information will help us to co related the performance test results with system configurations. Performance test results will vary with system configuration changes.


Project Name
XYZ
Application Background
XYZ is an online web based application which is used by the customer to place orders related to various products. System currently getting upgraded from oracle 10g to oracle 11i
Type of Project:
Seibel Web application/ SAP GUI
Application Technology
dot Net, IIS web server, C #, VB
Hardware Platform involved and OS
App Server - windows XP, 2 GB ram, 4 cpu
DB server - Windows XP SP2, 3 GB, 10 CPU
Database
Oracle
Third party tools



Create a workload model which covers the list of scenarios identified for the performance testing along the SLA’s and the user load. No of Txns and No of Concurrent Users will be derived from the volumetric analysis


S. No.
Transaction/Script
Online/Batch
No. of Concurrent Users
Response time
No. of
Txns.
1
Scenario 1
O
9
< 10 secs
12
2
Scenario 2
O
4
< 1 secs 
12
3
Scenario 3
O
15
< 2 secs 
143
4
Scenario 4
O
4
< 13 secs
20
5
Scenario 5
O
2
< 4 secs 
20
6
Scenario 6
B
3
< 5 secs 
3
7
Scenario 7
B/O
1
< 2 secs 
4
8
Scenario 8
O
1
< 4 secs 
5


Identify the different types of tests required for testing the application based on the requirement analysis.
In Load test,  measure server response times to verify if the application can sustain expected maximum number of concurrent users and expected maximum size of the database.
In Stress test, measure server response times at varying loads starting from low load (low number of concurrent users), medium load (average number of concurrent users) through high load (expected maximum number of concurrent users until unacceptable levels of response times are experienced) to validate application's stability and validity.
In Endurance test, test the application for longer durations with half the average system load to detect the possible memory leaks in the system.
A detailed test plan canl be laid out using the information captured during the requirements gathering phase and share it with the development/Business team and take their inputs for the final approval. Test plan should include the following (but not limited to):
  • Scope
  • Test Approach
  • Test Objectives
  • Test Environment setup and requirements
  • Types of tests
  • Transaction mix
  • Workload Scenario
  • Identify Monitors
  • Scheduling ( Testing sequence , Test cycles)
  • Data setup (Data required by the Test tool, not the Application data)


Test Design & Execution
During the test design phase, validate the existing scripts and develop new functionalities based on the workload model and also validate and update the data required for the test environment identified during the test plan and also analyze script failures with the intent of finding their root cause so that we can debug our scripts effectively. We should also collect any application related errors found during the script validations and share with the development team.
During the execution process, first try to validate the scripts are pointing to the correct environment and performance metrics to be captured are properly configured in the environment. We should also validate load generator machines are up and working fine.
Each script should be run individually several times to validate that the script has been developed correctly. These tests may reveal performance problems that will need to be addressed.
Mixed load test can be carried out for the identified scenarios consisting of all transactions, online and batch, according to the workload mix discussed earlier. The load tests have to be run multiple times to ensure that the testing process is repeatable and also configure all the performance metrics in the load testing tool prior to the start of the test.

Result Analysis and Reporting
 Focus on analysis, monitoring, identifying bottle necks and proving recommendations, thus providing an end-to-end performance solution for the complete application.
Send the test reports   from various tests results with  conclusions based on those results, and also with consolidated data that supports those conclusions. And also do analysis, comparisons, and details of how the results were obtained.
At the end of each run of the Performance Test, a report should be produced. The test report should have comprehensive data collected from various sources presented in a single document.
For each of the test cases, the following response times should be reported: arithmetic mean, standard deviation, 90th percentile response time and other percentiles as necessary. In addition, each test case also report the total number of transactions executed, the time period over which the transactions were executed, number of errors and number of retries.

Collect comprehensive set of system data and tabulated in the test report for each run. The data that will be collected will include CPU utilization, memory utilization – system wide and per process, DB statistics.

End to End Performance Test Approach - Part 1

The purpose of this post is to show end to end approach for implementing performance testing. This document covers different phases of performance testing and approach to follow for successful performance test

Requirement Gathering:


During the Requirement Analysis phase it is important to assess and understand the nature of the application and the environment in which testing and monitoring should be performed. In addition to this, identify the resource requirements and plan based on the application. Also the existing Non-functional Requirement (NFR) would be discussed and understood by having discussion with the Design & Development team with respect to SLA’s, number of concurrent users and volumetric information. Also, wherever there is any specific information lacking, the same should be discussed with Design & Development team to reach a mutual agreement and definition of the same.

Performance team should also analyze system volume metrics over specified period of time in production to identify load patterns, work load behavior, Peak user load etc in the system.


Identify the peak period during volume metric analysis and also transaction arrival rate along with the user concurrency to simulate in the test environment. Create a workload model for the peak load with transaction mix along the user load based on the volume metric analysis. This will also help us in identifying the types of tests required for testing the application. Below is a sample template of the workload model for deriving the peak load.


Workload Model

Setting LoadRunner Header File Path

LoadRunner automatically compiles all the header files present in LoadRunner\include directory.


But it required header files need to be updated regularly after changes into the LoadRunner\include directory.



You can change the load runner properties to include your own header files folder

Please follow the step below one by one:

1. Browse through C:\Program Files\Mercury\LoadRunner\dat folder and search mdrv.dat file

2. Create a back up of the file in the same directory before making changes

3. Open the mdrv.dat file and search for [lrun_api] text in the file

4. You will reach the below section

[lrun_api]


ExtPriorityType=internal


WINNT_EXT_LIBS=lrun50.dll


WIN95_EXT_LIBS=lrun50.dll


LINUX_EXT_LIBS=libLrun50.so


SOLARIS_EXT_LIBS=libLrun50.so


HPUX_EXT_LIBS=libLrun50.sl


AIX_EXT_LIBS=libLrun50.so


LibCfgFunc=LrunApi_configure


UtilityExt=ParamEngine,Transaction,vusr_log,faserver,run_time_context


ExtIncludeFiles=lrun.h


ActiveScriptItems=Message:Mercury.Lrvb.LrMessage.1,Timing:Mercury.Lrvb.LrTiming2.1,Transaction:Mercury.Lrvb.LrTransaction2.1


ExtMessageQueue=0


SecurityRequirementsFiles=AllowedFunctions.asl


SecurityMode=On


5. Now add a new line to this section as follow


[lrun_api]


ExtPriorityType=internal


WINNT_EXT_LIBS=lrun50.dll


WIN95_EXT_LIBS=lrun50.dll


LINUX_EXT_LIBS=libLrun50.so


SOLARIS_EXT_LIBS=libLrun50.so


HPUX_EXT_LIBS=libLrun50.sl


AIX_EXT_LIBS=libLrun50.so


LibCfgFunc=LrunApi_configure


UtilityExt=ParamEngine,Transaction,vusr_log,faserver,run_time_context


ExtIncludeFiles=lrun.h


ActiveScriptItems=Message:Mercury.Lrvb.LrMessage.1,Timing:Mercury.Lrvb.LrTiming2.1,Transaction:Mercury.Lrvb.LrTransaction2.1


ExtCmdLine=-compile_flags C:\testscripts\ExternalHeaderFilePath


ExtMessageQueue=0


SecurityRequirementsFiles=AllowedFunctions.asl


SecurityMode=On

6. After making the changes, save the mdrv.dat file.

7. Take a sample script and execute the test once. 

How to convert LoadRunner Siebel-Web Protocol scripts to Web Protocol

1. Open the LoadRunner .usr file in Notepad

2. Check the following lines in the notepad

[General]


Type=Multi


AdditionalTypes=Siebel_Web


ActiveTypes=Siebel_Web


GenerateTypes=Siebel_Web




3. Replace Siebel_web with QTWeb

[General]


Type=Multi


AdditionalTypes=QTWeb


ActiveTypes=QTWeb


GenerateTypes=QTWeb


RecordedProtocols=QTWeb



Controller no longer requires separate license for Siebel-Web. Web user license is sufficient to execute the scripts

Performance Assessment Methodology

The below approach will help you if  you are planning to act as a consultant for a performance testing engagement and provide recomendations to your customer.

The attached diagram will detail  the assessment approach for the following topics

  • Study the existing test processes
  • Study Architecture
  • Gather Business Information
  • Define test Model
  • Define tests 

Performance Assessment Methodology