ACMApplicative2016_SystemMethodology.pdf

ACM Applicative 2016: System Methodology

Video: https://youtu.be/eO94l0aGLCA?t=3m37s

System Methodology - Holistic Performance Analysis on Modern Systems

Description: "Traditional systems performance engineering makes do with vendor-supplied metrics, often involving interpretation and inference, and with numerous blind spots. Much in the field of systems performance is still living in the past: documentation, procedures, and analysis GUIs built upon the same old metrics. For modern systems, we can choose the metrics, and can choose ones we need to support new holistic performance analysis methodologies. These methodologies provide faster, more accurate, and more complete analysis, and can provide a starting point for unfamiliar systems.

Methodologies are especially helpful for modern applications and their workloads, which can pose extremely complex problems with no obvious starting point. There are also continuous deployment environments such as the Netflix cloud, where these problems must be solved in shorter time frames. Fortunately, with advances in system observability and tracers, we have virtually endless custom metrics to aid performance analysis. The problem becomes which metrics to use, and how to navigate them quickly to locate the root cause of problems.

System methodologies provide a starting point for analysis, as well as guidance for quickly moving through the metrics to root cause. They also pose questions that the existing metrics may not yet answer, which may be critical in solving the toughest problems. System methodologies include the USE method, workload characterization, drill-down analysis, off-CPU analysis, and more.

This talk will discuss various system performance issues, and the methodologies, tools, and processes used to solve them. The focus is on single systems (any operating system), including single cloud instances, and quickly locating performance issues or exonerating the system. Many methodologies will be discussed, along with recommendations for their implementation, which may be as documented checklists of tools, or custom dashboards of supporting metrics. In general, you will learn to think differently about your systems, and how to ask better questions."

	next prev 1/57
	next prev 2/57
	next prev 3/57
	next prev 4/57
	next prev 5/57
	next prev 6/57
	next prev 7/57
	next prev 8/57
	next prev 9/57
	next prev 10/57
	next prev 11/57
	next prev 12/57
	next prev 13/57
	next prev 14/57
	next prev 15/57
	next prev 16/57
	next prev 17/57
	next prev 18/57
	next prev 19/57
	next prev 20/57
	next prev 21/57
	next prev 22/57
	next prev 23/57
	next prev 24/57
	next prev 25/57
	next prev 26/57
	next prev 27/57
	next prev 28/57
	next prev 29/57
	next prev 30/57
	next prev 31/57
	next prev 32/57
	next prev 33/57
	next prev 34/57
	next prev 35/57
	next prev 36/57
	next prev 37/57
	next prev 38/57
	next prev 39/57
	next prev 40/57
	next prev 41/57
	next prev 42/57
	next prev 43/57
	next prev 44/57
	next prev 45/57
	next prev 46/57
	next prev 47/57
	next prev 48/57
	next prev 49/57
	next prev 50/57
	next prev 51/57
	next prev 52/57
	next prev 53/57
	next prev 54/57
	next prev 55/57
	next prev 56/57
	next prev 57/57

PDF: ACMApplicative2016_SystemMethodology.pdf

Keywords (from pdftotext):

slide 1:

ACM Applicative 2016
Jun,	
 Â 2016	
 Â 
System	
 Â Methodology	
 Â 
Holis0c	
 Â Performance	
 Â Analysis	
 Â on	
 Â 
Modern	
 Â Systems	
 Â 
Brendan Gregg
Senior Performance Architect

slide 2:

slide 3:

Apollo LMGC
performance analysis
CORE	
 Â SET	
 Â 
AREA	
 Â 
VAC	
 Â SETS	
 Â 
ERASABLE	
 Â 
MEMORY	
 Â 
FIXED	
 Â 
MEMORY	
 Â

slide 4:

slide 5:

Background	
 Â

slide 6:

History	
 Â 
â€¢ System Performance Analysis up to the '90s:
â€“ Closed source UNIXes and applications
â€“ Vendor-created metrics and performance tools
â€“ Users interpret given metrics
â€¢ Problems
â€“ Vendors may not provide the best metrics
â€“ Often had to infer, rather than measure
â€“ Given metrics, what do we do with them?
# ps alx
F S UID
3 S
1 S
1 S
[â€¦]
PID
PPID CPU PRI NICE
0 30
0 30
ADDR
WCHAN TTY TIME CMD
4412 ? 186:14 swapper
46520 ?
0:00 /etc/init
46554 co 0:00 â€“sh

slide 7:

Today	
 Â 
1. Open source
Operating systems: Linux, BSDs, illumos, etc.
Applications: source online (Github)
2. Custom metrics
Can patch the open source, or,
Use dynamic tracing (open source helps)
3. Methodologies
Start with the questions, then make metrics to answer them
Methodologies can pose the questions
Biggest problem with dynamic tracing has been what to do with it.
Methodologies guide your usage.

slide 8:

Crystal	
 Â Ball	
 Â Thinking	
 Â

slide 9:

An#-Ââ€Methodologies	
 Â

slide 10:

Street	
 Â Light	
 Â An#-Ââ€Method	
 Â 
1. Pick observability tools that are
â€“ Familiar
â€“ Found on the Internet
â€“ Found at random
2. Run tools
3. Look for obvious issues

slide 11:

Drunk	
 Â Man	
 Â An#-Ââ€Method	
 Â 
â€¢ Drink Tune things at random until the problem goes away

slide 12:

Blame	
 Â Someone	
 Â Else	
 Â An#-Ââ€Method	
 Â 
1. Find a system or environment component you are not
responsible for
2. Hypothesize that the issue is with that component
3. Redirect the issue to the responsible team
4. When proven wrong, go to 1

slide 13:

Traï¬ƒc	
 Â Light	
 Â An#-Ââ€Method	
 Â 
1. Turn all metrics into traffic lights
2. Open dashboard
3. Everything green? No worries, mate.
â€¢ Type I errors: red instead of green
â€“ team wastes time
â€¢ Type II errors: green instead of red
â€“ performance issues undiagnosed
â€“ team wastes more time looking elsewhere
Traffic lights are suitable for objective metrics (eg, errors),
not subjective metrics (eg, IOPS, latency).

slide 14:

Methodologies	
 Â

slide 15:

Performance	
 Â Methodologies	
 Â 
â€¢ For system engineers:
â€“ ways to analyze unfamiliar
systems and applications
â€¢ For app developers:
â€“ guidance for metric and
dashboard design
Collect your
own toolbox of
methodologies
System Methodologies:
â€“ Problem statement method
â€“ Functional diagram method
â€“ Workload analysis
â€“ Workload characterization
â€“ Resource analysis
â€“ USE method
â€“ Thread State Analysis
â€“ On-CPU analysis
â€“ CPU flame graph analysis
â€“ Off-CPU analysis
â€“ Latency correlations
â€“ Checklists
â€“ Static performance tuning
â€“ Tools-based methods

slide 16:

Problem	
 Â Statement	
 Â Method	
 Â 
1. What makes you think there is a performance problem?
2. Has this system ever performed well?
3. What has changed recently?
software? hardware? load?
4. Can the problem be described in terms of latency?
or run time. not IOPS or throughput.
5. Does the problem affect other people or applications?
6. What is the environment?
software, hardware, instance types?
versions? config?

slide 17:

Func0onal	
 Â Diagram	
 Â Method	
 Â 
1. Draw the functional diagram
2. Trace all components in the data path
3. For each component, check performance
Breaks up a bigger problem into
smaller, relevant parts
Eg, imagine throughput
between the UCSB 360 and
the UTAH PDP10 was slowâ€¦
ARPA	
 Â Network	
 Â 1969	
 Â

slide 18:

Workload	
 Â Analysis	
 Â 
â€¢ Begin with application metrics & context
â€¢ A drill-down methodology
Workload	
 Â 
â€¢ Pros:
â€“ Proportional,
accurate metrics
â€“ App context
Applica0on	
 Â 
	
 Â 
	
 Â  Libraries	
 Â 
System	
 Â 
â€¢ Cons:
â€“ App specific
â€“ Difficult to dig from
app to resource
System	
 Â Calls	
 Â 
Kernel	
 Â 
Hardware	
 Â 
Analysis	
 Â

slide 19:

Workload	
 Â Characteriza0on	
 Â 
â€¢ Check the workload: who, why, what, how
â€“ not resulting performance
Workload	
 Â 
Target	
 Â 
â€¢ Eg, for CPUs:
Who: which PIDs, programs, users
Why: code paths, context
What: CPU instructions, cycles
How: changing over time

slide 20:

Workload	
 Â Characteriza0on:	
 Â CPUs	
 Â 
Who
Why
top
CPU	
 Â sample	
 Â 
ï¬‚ame	
 Â graphs	
 Â 
How
What
monitoring	
 Â 
PMCs	
 Â

slide 21:

Resource	
 Â Analysis	
 Â 
â€¢ Typical approach for system performance analysis:
begin with system tools & metrics
Workload	
 Â 
â€¢ Pros:
â€“ Generic
â€“ Aids resource
perf tuning
Applica0on	
 Â 
	
 Â 
	
 Â  Libraries	
 Â 
System	
 Â 
â€¢ Cons:
â€“ Uneven coverage
â€“ False positives
System	
 Â Calls	
 Â 
Kernel	
 Â 
Hardware	
 Â 
Analysis	
 Â

slide 22:

The	
 Â USE	
 Â Method	
 Â 
â€¢ For every resource, check:
Utilization: busy time
Saturation: queue length or time
Errors: easy to interpret (objective)
Starts with the questions, then finds the tools
Eg, for hardware, check every resource incl. busses:

slide 23:

http://www.brendangregg.com/USEmethod/use-rosetta.html

slide 24:

slide 25:

Apollo Guidance
Computer
CORE	
 Â SET	
 Â 
AREA	
 Â 
VAC	
 Â SETS	
 Â 
ERASABLE	
 Â 
MEMORY	
 Â 
FIXED	
 Â 
MEMORY	
 Â

slide 26:

USE	
 Â Method:	
 Â SoZware	
 Â 
â€¢ USE method can also work for software resources
â€“ kernel or app internals, cloud environments
â€“ small scale (eg, locks) to large scale (apps). Eg:
â€¢ Mutex locks:
â€“ utilization Ã ïƒ  lock hold time
â€“ saturation Ã ïƒ  lock contention
â€“ errors Ã ïƒ  any errors
X	
 Â 
â€¢ Entire application:
â€“ utilization Ã ïƒ  percentage of worker threads busy
â€“ saturation Ã ïƒ  length of queued work
â€“ errors Ã ïƒ  request errors
Resource	
 Â 
U0liza0on	
 Â 
(%)	
 Â

slide 27:

RED	
 Â Method	
 Â 
â€¢ For every service, check that:
Request rate
Error rate
Duration (distribution)
Metrics	
 Â 
Database	
 Â 
are within SLO/A
Another exercise in posing questions
from functional diagrams
Load	
 Â 
Balancer	
 Â 
Web	
 Â 
Proxy	
 Â 
Web	
 Â Server	
 Â 
User	
 Â 
Database	
 Â 
Payments	
 Â 
Server	
 Â 
Asset	
 Â 
Server	
 Â 
By Tom Wilkie: http://www.slideshare.net/weaveworks/monitoring-microservices

slide 28:

Thread	
 Â State	
 Â Analysis	
 Â 
State transition diagram
Identify & quantify
time in states
Narrows further
analysis to state
Thread states are
applicable to all apps

slide 29:

TSA:	
 Â eg,	
 Â Solaris	
 Â

slide 30:

TSA:	
 Â eg,	
 Â RSTS/E	
 Â 
RSTS: DEC OS
from the 1970's
TENEX (1969-72)
also had Control-T
for job states

slide 31:

TSA:	
 Â eg,	
 Â OS	
 Â X	
 Â 
Instruments:	
 Â Thread	
 Â States	
 Â

slide 32:

On-Ââ€CPU	
 Â Analysis	
 Â 
1. Split into user/kernel states
CPU	
 Â U0liza0on	
 Â 
Heat	
 Â Map	
 Â 
â€“ /proc, vmstat(1)
2. Check CPU balance
â€“ mpstat(1), CPU utilization heat map
3. Profile software
â€“ User & kernel stack sampling (as a CPU flame graph)
4. Profile cycles, caches, busses
â€“ PMCs, CPI flame graph

slide 33:

CPU	
 Â Flame	
 Â Graph	
 Â Analysis	
 Â 
1. Take a CPU profile
2. Render it as a flame graph
3. Understand all software that is in >gt;1% of samples
Discovers issues by their CPU usage
- Directly: CPU consumers
- Indirectly: initialization
of I/O, locks, times, ...
Narrows target of study
to only running code
- See: "The Flame Graph",
CACM, June 2016
Flame	
 Â Graph	
 Â

slide 34:

Java	
 Â Mixed-Ââ€Mode	
 Â CPU	
 Â Flame	
 Â Graph	
 Â 
â€¢ eg, Linux perf_events, with:
â€¢ Java â€“XX:+PreserveFramePointer
â€¢ Java perf-map-agent
Java	
 Â 
GC	
 Â 
JVM	
 Â 
Kernel	
 Â

slide 35:

CPI	
 Â Flame	
 Â Graph	
 Â 
â€¢ Profile cycle stack traces and instructions or stalls separately
â€¢ Generate CPU flame graph (cycles) and color using other profile
â€¢ eg, FreeBSD: pmcstat
red	
 Â ==	
 Â instruc0ons	
 Â 
blue	
 Â ==	
 Â stalls	
 Â

slide 36:

Oï¬€-Ââ€CPU	
 Â Analysis	
 Â 
Analyze off-CPU time
via blocking code path:
Off-CPU flame graph
Often need wakeup
code paths as wellâ€¦

slide 37:

Oï¬€-Ââ€CPU	
 Â Time	
 Â Flame	
 Â Graph	
 Â 
directory	
 Â read	
 Â 
from	
 Â disk	
 Â 
ï¬le	
 Â read	
 Â 
from	
 Â disk	
 Â 
fstat	
 Â from	
 Â disk	
 Â 
path	
 Â read	
 Â from	
 Â disk	
 Â 
pipe	
 Â write	
 Â 
Trace blocking events with
kernel stacks & time blocked
(eg, using Linux BPF)
Oï¬€-Ââ€CPU	
 Â 0me	
 Â 
Stack	
 Â depth	
 Â

slide 38:

Wakeup	
 Â Time	
 Â Flame	
 Â Graph	
 Â 
Who did the wakeup:
â€¦ can also associate wake-up stacks with off-CPU stacks
(eg, Linux 4.6: samples/bpf/offwaketime*)

slide 39:

Chain	
 Â Graphs	
 Â 
Associate more than
one waker: the full
chain of wakeups
With enough stacks,
all paths lead to metal
An approach for
analyzing all off-CPU
issues

slide 40:

Latency	
 Â Correla0ons	
 Â 
1. Measure latency
histograms at different
stack layers
2. Compare histograms
to find latency origin
Even better, use latency
heat maps
â€¢ Match outliers based on
both latency and time

slide 41:

Checklists:	
 Â eg,	
 Â Linux	
 Â Perf	
 Â Analysis	
 Â in	
 Â 60s	
 Â 
1. uptime
2. dmesg | tail
3. vmstat 1
4. mpstat -P ALL 1
5. pidstat 1
6. iostat -xz 1
7. free -m
8. sar -n DEV 1
9. sar -n TCP,ETCP 1
10. top
load	
 Â averages	
 Â 
kernel	
 Â errors	
 Â 
overall	
 Â stats	
 Â by	
 Â 0me	
 Â 
CPU	
 Â balance	
 Â 
process	
 Â usage	
 Â 
disk	
 Â I/O	
 Â 
memory	
 Â usage	
 Â 
network	
 Â I/O	
 Â 
TCP	
 Â stats	
 Â 
check	
 Â overview	
 Â 
http://techblog.netflix.com/2015/11/linux-performance-analysis-in-60s.html

slide 42:

Checklists:	
 Â eg,	
 Â Neklix	
 Â perfvitals	
 Â Dashboard	
 Â 
1.	
 Â RPS,	
 Â CPU	
 Â 
2.	
 Â Volume	
 Â 
3.	
 Â Instances	
 Â 
4.	
 Â Scaling	
 Â 
5.	
 Â CPU/RPS	
 Â 
6.	
 Â Load	
 Â Avg	
 Â 
7.	
 Â Java	
 Â Heap	
 Â 
8.	
 Â ParNew	
 Â 
9.	
 Â Latency	
 Â 
10.	
 Â 99th	
 Â 0le	
 Â

slide 43:

Sta0c	
 Â Performance	
 Â Tuning:	
 Â eg,	
 Â Linux	
 Â

slide 44:

Tools-Ââ€Based	
 Â Method	
 Â 
1. Try all the tools! May be an anti-pattern. Eg, OS X:

slide 45:

Other	
 Â Methodologies	
 Â 
Scientific method
5 Why's
Process of elimination
Intel's Top-Down Methodology
Method R

slide 46:

What	
 Â You	
 Â Can	
 Â Do	
 Â

slide 47:

What	
 Â you	
 Â can	
 Â do	
 Â 
1. Know what's now possible on modern systems
â€“ Dynamic tracing: efficiently instrument any software
â€“ CPU facilities: PMCs, MSRs (model specific registers)
â€“ Visualizations: flame graphs, latency heat maps, â€¦
2. Ask questions first: use methodologies to ask them
3. Then find/build the metrics
4. Build or buy dashboards to support methodologies

slide 48:

Dynamic	
 Â Tracing:	
 Â Eï¬ƒcient	
 Â Metrics	
 Â 
Eg, tracing TCP retransmits
Kernel	
 Â 
Old way: packet capture
tcpdump	
 Â  1.	
 Â read	
 Â 
2.	
 Â dump	
 Â 
buï¬€er	
 Â 
Analyzer	
 Â  1.	
 Â read	
 Â 
2.	
 Â process	
 Â 
3.	
 Â print	
 Â 
ï¬le	
 Â system	
 Â 
send	
 Â 
receive	
 Â 
disks	
 Â 
New way: dynamic tracing
Tracer	
 Â 
1.	
 Â conï¬gure	
 Â 
2.	
 Â read	
 Â 
tcp_retransmit_skb()	
 Â

slide 49:

Dynamic	
 Â Tracing:	
 Â Measure	
 Â Anything	
 Â 
Those are Solaris/DTrace tools. Now becoming possible on all OSes:
FreeBSD & OS X DTrace, Linux BPF, Windows ETW

slide 50:

Performance	
 Â Monitoring	
 Â Counters	
 Â 
Eg, FreeBSD PMC groups for Intel Sandy Bridge:

slide 51:

Visualiza0ons	
 Â 
Eg, Disk I/O latency as a heat map, quantized in kernel:

slide 52:

USE	
 Â Method:	
 Â eg,	
 Â Neklix	
 Â Vector	
 Â 
CPU:	
 Â 
u0liza0on	
 Â 
Network:	
 Â 
u0liza0on	
 Â 
Memory:	
 Â 
u0liza0on	
 Â 
Disk:	
 Â 
load	
 Â 
satura0on	
 Â 
satura0on	
 Â 
load	
 Â 
satura0on	
 Â 
u0liza0on	
 Â 
satura0on	
 Â

slide 53:

USE	
 Â Method:	
 Â To	
 Â Do	
 Â 
Showing what is and is not commonly measured
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â 
U	
 Â  S	
 Â  E	
 Â

slide 54:

CPU	
 Â Workload	
 Â Characteriza0on:	
 Â To	
 Â Do	
 Â 
Showing what is and is not commonly measured
Who
Why
top,	
 Â htop
perf record -g
ï¬‚ame	
 Â Graphs	
 Â 
How
What
monitoring	
 Â 
perf stat -a -d

slide 55:

Summary	
 Â 
â€¢ It is the crystal ball age of performance observability
â€¢ What matters is the questions you want answered
â€¢ Methodologies are a great way to pose questions

slide 56:

References	
 Â &	
 Â Resources	
 Â 
USE Method
â€“ http://queue.acm.org/detail.cfm?id=2413037
â€“ http://www.brendangregg.com/usemethod.html
TSA Method
â€“ http://www.brendangregg.com/tsamethod.html
Off-CPU Analysis
â€“ http://www.brendangregg.com/offcpuanalysis.html
â€“ http://www.brendangregg.com/blog/2016-01-20/ebpf-offcpu-flame-graph.html
â€“ http://www.brendangregg.com/blog/2016-02-05/ebpf-chaingraph-prototype.html
Static Performance Tuning, Richard Elling, Sun blueprint, May 2000
RED Method: http://www.slideshare.net/weaveworks/monitoring-microservices
Other system methodologies
â€“ Systems Performance: Enterprise and the Cloud, Prentice Hall 2013
â€“ http://www.brendangregg.com/methodology.html
â€“ The Art of Computer Systems Performance Analysis, Jain, R., 1991
Flame Graphs
â€“ http://queue.acm.org/detail.cfm?id=2927301
â€“ http://www.brendangregg.com/flamegraphs.html
â€“ http://techblog.netflix.com/2015/07/java-in-flames.html
Latency Heat Maps
â€“ http://queue.acm.org/detail.cfm?id=1809426
â€“ http://www.brendangregg.com/HeatMaps/latency.html
ARPA Network: http://www.computerhistory.org/internethistory/1960s
RSTS/E System User's Guide, 1985, page 4-5
DTrace: Dynamic Tracing in Oracle Solaris, Mac OS X, and FreeBSD, Prentice Hall 2011
Apollo: http://www.hq.nasa.gov/office/pao/History/alsj/a11 http://www.hq.nasa.gov/alsj/alsj-LMdocs.html

slide 57:

ACM Applicative 2016
Questions?
http://slideshare.net/brendangregg
http://www.brendangregg.com
bgregg@netflix.com
@brendangregg
Feb	
 Â 
Jun,	
 Â 
2016	
 Â 
2016	
 Â