1		README for GPROF
2
3This is the GNU profiler.  It is distributed with other "binary
4utilities" which should be in ../binutils.  See ../binutils/README for
5more general notes, including where to send bug reports.
6
7This file documents the changes and new features available with this
8version of GNU gprof.
9
10* New Features
11
12 o Long options
13
14 o Supports generalized file format, without breaking backward compatibility:
15   new file format supports basic-block execution counts and non-realtime
16   histograms (see below)
17
18 o Supports profiling at the line level: flat profiles, call-graph profiles,
19   and execution-counts can all be displayed at a level that identifies
20   individual lines rather than just functions
21
22 o Test-coverage support (similar to Sun tcov program): source files
23   can be annotated with the number of times a function was invoked
24   or with the number of times each basic-block in a function was
25   executed
26
27 o Generalized histograms: not just execution-time, but arbitrary
28   histograms are support (for example, performance counter based
29   profiles)
30
31 o Powerful mechanism to select data to be included/excluded from
32   analysis and/or output
33
34 o Support for DEC OSF/1 v3.0
35
36 o Full cross-platform profiling support: gprof uses BFD to support
37   arbitrary, non-native object file formats and non-native byte-orders
38   (this feature has not been tested yet)
39
40 o In the call-graph function index, static function names are now
41   printed together with the filename in which the function was defined
42   (required bfd_find_nearest_line() support and symbolic debugging
43    information to be present in the executable file)
44
45 o Major overhaul of source code (compiles cleanly with -Wall, etc.)
46
47* Supported Platforms
48
49The current version is known to work on:
50
51 o DEC OSF/1 v3.0
52	All features supported.
53
54 o SunOS 4.1.x
55	All features supported.
56
57 o Solaris 2.3
58	Line-level profiling unsupported because bfd_find_nearest_line()
59	is not fully implemented for Elf binaries.
60
61 o HP-UX 9.01
62	Line-level profiling unsupported because bfd_find_nearest_line()
63	is not fully implemented for SOM binaries.
64
65* Detailed Description
66
67** User Interface Changes
68
69The command-line interface is backwards compatible with earlier
70versions of GNU gprof and Berkeley gprof.  The only exception is
71the option to delete arcs from the call graph.  The old syntax
72was:
73
74	-k fromname toname
75
76while the new syntax is:
77
78	-k fromname/toname
79
80This change was necessary to be compatible with long-option parsing.
81Also, "fromname" and "toname" can now be arbitrary symspecs rather
82than just function names (see below for an explanation of symspecs).
83For example, option "-k gprof.c/" suppresses all arcs due to calls out
84of file "gprof.c".
85
86*** Sym Specs
87
88It is often necessary to apply gprof only to specific parts of a
89program.  GNU gprof has a simple but powerful mechanism to achieve
90this.  So called {\em symspecs\/} provide the foundation for this
91mechanism.  A symspec selects the parts of a profiled program to which
92an operation should be applied to.  The syntax of a symspec is
93simple:
94
95	  filename_containing_a_dot
96	| funcname_not_containing_a_dot
97	| linenumber
98	| ( [ any_filename ] `:' ( any_funcname | linenumber ) )
99
100Here are some examples:
101
102	main.c			Selects everything in file "main.c"---the
103				dot in the string tells gprof to interpret
104				the string as a filename, rather than as
105				a function name.  To select a file whose
106				name does contain a dot, a trailing colon
107				should be specified.  For example, "odd:" is
108				interpreted as the file named "odd".
109
110	main			Selects all functions named "main".  Notice
111				that there may be multiple instances of the
112				same function name because some of the
113				definitions may be local (i.e., static).
114				Unless a function name is unique in a program,
115				you must use the colon notation explained
116				below to specify a function from a specific
117				source file.  Sometimes, functionnames contain
118				dots.  In such cases, it is necessary to
119				add a leading colon to the name.  For example,
120				":.mul" selects function ".mul".
121
122	main.c:main		Selects function "main" in file "main.c".
123
124	main.c:134		Selects line 134 in file "main.c".
125
126IMPLEMENTATION NOTE: The source code uses the type sym_id for symspecs.
127At some point, this probably ought to be changed to "sym_spec" to make
128reading the code easier.
129
130*** Long options
131
132GNU gprof now supports long options.  The following is a list of all
133supported options.  Options that are listed without description
134operate in the same manner as the corresponding option in older
135versions of gprof.
136
137Short Form:	Long Form:
138-----------	----------
139-l		--line
140			Request profiling at the line-level rather
141			than just at the function level.  Source
142			lines are identified by symbols of the form:
143
144				func (file:line)
145
146			where "func" is the function name, "file" is the
147			file name and "line" is the line-number that
148			corresponds to the line.
149
150			To work properly, the binary must contain symbolic
151			debugging information.  This means that the source
152			have to be translated with option "-g" specified.
153			Functions for which there is no symbolic debugging
154			information available are treated as if "--line"
155			had not been specified.  However, the line number
156			printed with such symbols is usually incorrect
157			and should be ignored.
158
159-a		--no-static
160-A[symspec]	--annotated-source[=symspec]
161			Request output in the form of annotated source
162			files.  If "symspec" is specified, print output only
163			for symbols selected by "symspec".  If the option
164			is specified multiple times, annotated output is
165			generated for the union of all symspecs.
166
167			Examples:
168
169			  -A		Prints annotated source for all
170					source files.
171			  -Agprof.c	Prints annotated source for file
172					gprof.c.
173			  -Afoobar	Prints annotated source for files
174					containing a function named "foobar".
175					The entire file will be printed, but
176					only the function itself will be
177					annotated with profile data.
178
179-J[symspec]	--no-annotated-source[=symspec]
180			Suppress annotated source output.  If specified
181			without argument, annotated output is suppressed
182			completely.  With an argument, annotated output
183			is suppressed only for the symbols selected by
184			"symspec".  If the option is specified multiple
185			times, annotated output is suppressed for the
186			union of all symspecs.  This option has lower
187			precedence than --annotated-source
188
189-p[symspec]	--flat-profile[=symspec]
190			Request output in the form of a flat profile
191			(unless any other output-style option is specified,
192			 this option is turned on by default).  If
193			"symspec" is specified, include only symbols
194			selected by "symspec" in flat profile.  If the
195			option is specified multiple times, the flat
196			profile includes symbols selected by the union
197			of all symspecs.
198
199-P[symspec]	--no-flat-profile[=symspec]
200			Suppress output in the flat profile.  If given
201			without an argument, the flat profile is suppressed
202			completely.  If "symspec" is specified, suppress
203			the selected symbols in the flat profile.  If the
204			option is specified multiple times, the union of
205			the selected symbols is suppressed.  This option
206			has lower precedence than --flat-profile.
207
208-q[symspec]	--graph[=symspec]
209			Request output in the form of a call-graph
210			(unless any other output-style option is specified,
211			 this option is turned on by default).  If "symspec"
212			is specified, include only symbols selected by
213			"symspec" in the call-graph.  If the option is
214			specified multiple times, the call-graph includes
215			symbols selected by the union of all symspecs.
216
217-Q[symspec]	--no-graph[=symspec]
218			Suppress output in the call-graph.  If given without
219			an argument, the call-graph is suppressed completely.
220			With a "symspec", suppress the selected symbols
221			from the call-graph.  If the option is specified
222			multiple times, the union of the selected symbols
223			is suppressed.  This option has lower precedence
224			than --graph.
225
226-C[symspec]	--exec-counts[=symspec]
227			Request output in the form of execution counts.
228			If "symspec" is present, include only symbols
229			selected by "symspec" in the execution count
230			listing.  If the option is specified multiple
231			times, the execution count listing includes
232			symbols selected by the union of all symspecs.
233
234-Z[symspec]	--no-exec-counts[=symspec]
235			Suppress output in the execution count listing.
236			If given without an argument, the listing is
237			suppressed completely.  With a "symspec", suppress
238			the selected symbols from the call-graph.  If the
239			option is specified multiple times, the union of
240			the selected symbols is suppressed.  This option
241			has lower precedence than --exec-counts.
242
243-i		--file-info
244			Print information about the profile files that
245			are read.  The information consists of the
246			number and types of records present in the
247			profile file.  Currently, a profile file can
248			contain any number and any combination of histogram,
249			call-graph, or basic-block count records.
250
251-s		--sum
252
253-x		--all-lines
254			This option affects annotated source output only.
255			By default, only the lines at the beginning of
256			a basic-block are annotated.  If this option is
257			specified, every line in a basic-block is annotated
258			by repeating the annotation for the first line.
259			This option is identical to tcov's "-a".
260
261-I dirs		--directory-path=dirs
262			This option affects annotated source output only.
263			Specifies the list of directories to be searched
264			for source files.  The argument "dirs" is a colon
265			separated list of directories.  By default, gprof
266			searches for source files relative to the current
267			working directory only.
268
269-z		--display-unused-functions
270
271-m num		--min-count=num
272			This option affects annotated source and execution
273			count output only.  Symbols that are executed
274			less than "num" times are suppressed.  For annotated
275			source output, suppressed symbols are marked
276			by five hash-marks (#####).  In an execution count
277			output, suppressed symbols do not appear at all.
278
279-L		--print-path
280			Normally, source filenames are printed with the path
281			component suppressed.  With this option, gprof
282			can be forced to print the full pathname of
283			source filenames.  The full pathname is determined
284			from symbolic debugging information in the image file
285			and is relative to the directory in which the compiler
286			was invoked.
287
288-y		--separate-files
289			This option affects annotated source output only.
290			Normally, gprof prints annotated source files
291			to standard-output.  If this option is specified,
292			annotated source for a file named "path/filename"
293			is generated in the file "filename-ann".  That is,
294			annotated output is {\em always\/} generated in
295			gprof's current working directory.  Care has to
296			be taken if a program consists of files that have
297			identical filenames, but distinct paths.
298
299-c		--static-call-graph
300
301-t num		--table-length=num
302			This option affects annotated source output only.
303			After annotating a source file, gprof generates
304			an execution count summary consisting of a table
305			of lines with the top execution counts.  By
306			default, this table is ten entries long.
307			This option can be used to change the table length
308			or, by specifying an argument value of 0, it can be
309			suppressed completely.
310
311-n symspec	--time=symspec
312			Only symbols selected by "symspec" are considered
313			in total and percentage time computations.
314			However, this option does not affect percentage time
315			computation for the flat profile.
316			If the option is specified multiple times, the union
317			of all selected symbols is used in time computations.
318
319-N		--no-time=symspec
320			Exclude the symbols selected by "symspec" from
321			total and percentage time computations.
322			However, this option does not affect percentage time
323			computation for the flat profile.
324			This option is ignored if any --time options are
325			specified.
326
327-w num		--width=num
328			Sets the output line width.  Currently, this option
329			affects the printing of the call-graph function index
330			only.
331
332-e		<no long form---for backwards compatibility only>
333-E		<no long form---for backwards compatibility only>
334-f		<no long form---for backwards compatibility only>
335-F		<no long form---for backwards compatibility only>
336-k		<no long form---for backwards compatibility only>
337-b		--brief
338-dnum		--debug[=num]
339
340-h		--help
341			Prints a usage message.
342
343-O name		--file-format=name
344			Selects the format of the profile data files.
345			Recognized formats are "auto", "bsd", "magic",
346			and "prof".  The last one is not yet supported.
347			Format "auto" attempts to detect the file format
348			automatically (this is the default behavior).
349			It attempts to read the profile data files as
350			"magic" files and if this fails, falls back to
351			the "bsd" format.  "bsd" forces gprof to read
352			the data files in the BSD format.  "magic" forces
353			gprof to read the data files in the "magic" format.
354
355-T		--traditional
356-v		--version
357
358** File Format Changes
359
360The old BSD-derived format used for profile data does not contain a
361magic cookie that allows to check whether a data file really is a
362gprof file.  Furthermore, it does not provide a version number, thus
363rendering changes to the file format almost impossible.  GNU gprof
364uses a new file format that provides these features.  For backward
365compatibility, GNU gprof continues to support the old BSD-derived
366format, but not all features are supported with it.  For example,
367basic-block execution counts cannot be accommodated by the old file
368format.
369
370The new file format is defined in header file \file{gmon_out.h}.  It
371consists of a header containing the magic cookie and a version number,
372as well as some spare bytes available for future extensions.  All data
373in a profile data file is in the native format of the host on which
374the profile was collected.  GNU gprof adapts automatically to the
375byte-order in use.
376
377In the new file format, the header is followed by a sequence of
378records.  Currently, there are three different record types: histogram
379records, call-graph arc records, and basic-block execution count
380records.  Each file can contain any number of each record type.  When
381reading a file, GNU gprof will ensure records of the same type are
382compatible with each other and compute the union of all records.  For
383example, for basic-block execution counts, the union is simply the sum
384of all execution counts for each basic-block.
385
386*** Histogram Records
387
388Histogram records consist of a header that is followed by an array of
389bins.  The header contains the text-segment range that the histogram
390spans, the size of the histogram in bytes (unlike in the old BSD
391format, this does not include the size of the header), the rate of the
392profiling clock, and the physical dimension that the bin counts
393represent after being scaled by the profiling clock rate.  The
394physical dimension is specified in two parts: a long name of up to 15
395characters and a single character abbreviation.  For example, a
396histogram representing real-time would specify the long name as
397"seconds" and the abbreviation as "s".  This feature is useful for
398architectures that support performance monitor hardware (which,
399fortunately, is becoming increasingly common).  For example, under DEC
400OSF/1, the "uprofile" command can be used to produce a histogram of,
401say, instruction cache misses.  In this case, the dimension in the
402histogram header could be set to "i-cache misses" and the abbreviation
403could be set to "1" (because it is simply a count, not a physical
404dimension).  Also, the profiling rate would have to be set to 1 in
405this case.
406
407Histogram bins are 16-bit numbers and each bin represent an equal
408amount of text-space.  For example, if the text-segment is one
409thousand bytes long and if there are ten bins in the histogram, each
410bin represents one hundred bytes.
411
412
413*** Call-Graph Records
414
415Call-graph records have a format that is identical to the one used in
416the BSD-derived file format.  It consists of an arc in the call graph
417and a count indicating the number of times the arc was traversed
418during program execution.  Arcs are specified by a pair of addresses:
419the first must be within caller's function and the second must be
420within the callee's function.  When performing profiling at the
421function level, these addresses can point anywhere within the
422respective function.  However, when profiling at the line-level, it is
423better if the addresses are as close to the call-site/entry-point as
424possible.  This will ensure that the line-level call-graph is able to
425identify exactly which line of source code performed calls to a
426function.
427
428*** Basic-Block Execution Count Records
429
430Basic-block execution count records consist of a header followed by a
431sequence of address/count pairs.  The header simply specifies the
432length of the sequence.  In an address/count pair, the address
433identifies a basic-block and the count specifies the number of times
434that basic-block was executed.  Any address within the basic-address can
435be used.
436
437IMPLEMENTATION NOTE: gcc -a can be used to instrument a program to
438record basic-block execution counts.  However, the __bb_exit_func()
439that is currently present in libgcc2.c does not generate a gmon.out
440file in a suitable format.  This should be fixed for future releases
441of gcc.  In the meantime, contact davidm@cs.arizona.edu for a version
442of __bb_exit_func() to is appropriate.
443
444Copyright (C) 2012-2014 Free Software Foundation, Inc.
445
446Copying and distribution of this file, with or without modification,
447are permitted in any medium without royalty provided the copyright
448notice and this notice are preserved.
449