diff options
Diffstat (limited to 'www/python-reference.html')
| -rwxr-xr-x | www/python-reference.html | 259 |
1 files changed, 251 insertions, 8 deletions
diff --git a/www/python-reference.html b/www/python-reference.html index d6aa80f87a1b..bde728f9a4c9 100755 --- a/www/python-reference.html +++ b/www/python-reference.html @@ -316,6 +316,195 @@ Enter your Python command(s). Type 'DONE' to end. </div> </div> + </div> + <div class="post"> + <h1 class ="postheader">Using the Python API's to create custom breakpoints</h1> + <div class="postcontent"> + + <p>Another use of the Python API's in lldb is to create a custom breakpoint resolver. This facility + was added in r342259. + </p> + <p> + It allows you to provide the algorithm which will be used in the breakpoint's + search of the space of the code in a given Target + to determine where to set the breakpoint locations - the actual places where the breakpoint will trigger. + To understand how this works you need to know a little about how lldb handles breakpoints. + </p> + <p> + In lldb, a breakpoint is composed of three parts: the Searcher, the Resolver, and the Stop Options. The Searcher and + Resolver cooperate to determine how breakpoint locations are set and differ between each breakpoint type. + Stop options determine what happens when a location triggers and includes the commands, conditions, ignore counts, etc. + Stop options are common between all breakpoint types, so for our purposes only the Searcher and Resolver are relevant. + </p> + <p> + The Searcher's job is to traverse in a structured way the code in the current target. It + proceeds from the Target, to search all the Modules in the Target, in each Module it can recurse + into the Compile Units in that module, and within each Compile Unit it can recurse over the Functions + it contains. + </p> + <p> + The Searcher can be provided with a SearchFilter that it will use to restrict this search. For instance, if the + SearchFilter specifies a list of Modules, the Searcher will not recurse into Modules that aren't on the list. + When you pass the <b>-s modulename</b> flag to <b>break set</b> you are creating a Module-based search filter. + When you pass <b>-f filename.c</b> to <b>break set -n</b> you are creating a file based search filter. If neither + of these is specified, the breakpoint will have a no-op search filter, so all parts of the program are searched + and all locations accepted. + </p> + <p> + The Resolver has two functions. The most important one is the callback it provides. This will get called at the appropriate time + in the course of the search. The callback is where the job of adding locations to the breakpoint gets done. + </p> + <p> + The other function is specifying to the Searcher at what depth in the above described recursion it wants to be + called. Setting a search depth also provides a stop for the recursion. For instance, if you request a Module depth + search, then the callback will be called for each Module as it gets added to the Target, but the searcher will not recurse into the + Compile Units in the module. + </p> + <p> + One other slight sublety is that the depth at which you get called back is not necessarily the depth at which the + the SearchFilter is specified. For instance, if you are doing symbol searches, it is convenient to use the Module + depth for the search, since symbols are stored in the module. + But the SearchFilter might specify some subset of CompileUnits, so not all the symbols you might find in each module + will pass the search. You don't need to + handle this situation yourself, since <b>SBBreakpoint::AddLocation</b> will only add locations that pass the Search Filter. + This API returns an SBError to inform you whether your location was added. + </p> + <p> + When the breakpoint is originally created, its Searcher will process all the currently loaded modules. + The Searcher will also visit any new modules as they are added to the target. This happens, for instance, when + a new shared library gets added to the target in the course of running, or on rerunning if any of the currently + loaded modules have been changed. Note, in the latter case, all the locations set in the old module will get + deleted and you will be asked to recreate them in the new version of the module when your callback gets called + with that module. For this reason, you shouldn't + try to manage the locations you add to the breakpoint yourself. Note that the Breakpoint takes care of + deduplicating equal addresses in AddLocation, so you shouldn't need to worry about that anyway. + </p> + <p> + At present, when adding a scripted Breakpoint type, you can only provide a custom Resolver, not a custom SearchFilter. + </p> + <p> + The custom Resolver is provided as a Python class with the following methods: + </p> +</tt></pre></code> + <p><table class="stats" width="620" cellspacing="0"> + <tr> + <td class="hed" width="10%">Name</td> + <td class="hed" width="10%">Arguments</td> + <td class="hed" width="80%">Description</td> + </tr> + + + <tr> + <td class="content"> + <b>__init__</b> + </td> + <td class="content"> + <b>bkpt: lldb.SBBreakpoint</b> + <b>extra_args: lldb.SBStructuredData</b> + </td> + <td class="content"> + <p> + This is the constructor for the new Resolver. + </p> + <p> + <b>bkpt</b> is the breakpoint owning this Resolver. + </p> + <p> + <b>extra_args</b> is an SBStructuredData object that the user can pass in when creating instances of this + breakpoint. It is not required, but is quite handy. For instance if you were implementing a breakpoint on some + symbol name, you could write a generic symbol name based Resolver, and then allow the user to pass + in the particular symbol in the extra_args + </td> + </tr> + + <tr> + <td class="content"> + <b>__callback__</b> + </td> + <td class="content"> + <b>sym_ctx: lldb.SBSymbolContext</b> + </td> + <td class="content"> + This is the Resolver callback. + The <b>sym_ctx</b> argument will be filled with the current stage + of the search. + </p> + <p> + For instance, if you asked for a search depth of lldb.eSearchDepthCompUnit, then the + target, module and compile_unit fields of the sym_ctx will be filled. The callback should look just in the + context passed in <b>sym_ctx</b> for new locations. If the callback finds an address of interest, it + can add it to the breakpoint with the <b>SBBreakpoint::AddLocation</b> method, using the breakpoint passed + in to the <b>__init__</b> method. + </td> + </tr> + <tr> + <td class="content"> + <b>__get_depth__</b> + </td> + <td class="content"> + <b>None</b> + </td> + <td class="content"> + Specify the depth at which you wish your callback to get called. The currently supported options are: + <dl> + <dt>lldb.eSearchDepthModule</dt> + <dt>lldb.eSearchDepthCompUnit</dt> + <dt>lldb.eSearchDepthFunction</dt> + </dl> + For instance, if you are looking + up symbols, which are stored at the Module level, you will want to get called back module by module. + So you would want to return <b>lldb.eSearchDepthModule</b>. This method is optional. If not provided the search + will be done at Module depth. + </td> + </tr> + <tr> + <td class="content"> + <b>get_short_help</b> + </td> + <td class="content"> + <b>None</b> + </td> + <td class="content"> + This is an optional method. If provided, the returned string will be printed at the beginning of + the description for this breakpoint. + </td> + </tr> + </table> + + <p>To define a new breakpoint command defined by this class from the lldb command line, use the command:</p> + +<code><pre><tt>(lldb) <strong>breakpoint set -P MyModule.MyResolverClass</strong> +</tt></pre></code> + <p>You can also populate the extra_args SBStructuredData with a dictionary of key/value pairs with:</p> + +<code><pre><tt>(lldb) <strong>breakpoint set -P MyModule.MyResolverClass -k key_1 -v value_1 -k key_2 -v value_2</strong> +</tt></pre></code> + <p>Although you can't write a scripted SearchFilter, both the command line and the SB API's for adding a + scripted resolver allow you to specify a SearchFilter restricted to certain modules or certain compile + units. When using the command line to create the resolver, you can specify a Module specific SearchFilter + by passing the <b>-s ModuleName</b> option - which can be specified multiple times. + You can also specify a SearchFilter restricted to certain + compile units by passing in the <b>-f CompUnitName</b> option. This can also be specified more than + once. And you can mix the two to specify "this comp unit in this module". So, for instance, + </p> + +<code><pre><tt>(lldb) <strong>breakpoint set -P MyModule.MyResolverClass -s a.out</strong> +</tt></pre></code> + <p> + will use your resolver, but will only recurse into or accept new locations in the module a.out. + </p> + + <p>Another option for creating scripted breakpoints is to use the <b>SBTarget.CreateBreakpointFromScript</b> API. + This one has the advantage that you can pass in an arbitrary SBStructuredData object, so you can + create more complex parametrizations. + SBStructuredData has a handy SetFromJSON method which you can use for this purpose. + Your __init__ function gets passed this SBStructuredData object. + This API also allows you to directly provide the list of Modules and the list of CompileUnits that will + make up the SearchFilter. If you pass in empty lists, the breakpoint will use the default "search everywhere,accept + everything" filter. + </p> + + </div> <div class="post"> <h1 class ="postheader">Using the Python API's to create custom stepping logic</h1> <div class="postcontent"> @@ -654,13 +843,14 @@ $2 = 0x000000010010aba0 Let's Be Friendsfoobar</tt></pre></code> <b>LLDB.framework</b> to create your own stand-alone python programs, you will need to tell python where to look in order to find this module. This is done by setting the <b>PYTHONPATH</b> environment variable, adding - a path to the directory that contains the <b>lldb.py</b> python module. On - Mac OS X, this is contained inside the LLDB.framework, so you would do: + a path to the directory that contains the <b>lldb.py</b> python module. The + lldb driver program has an option to report the path to the lldb module. + You can use that to point to correct lldb.py: <p>For csh and tcsh:</p> - <p><code>% <b>setenv PYTHONPATH /Developer/Library/PrivateFrameworks/LLDB.framework/Resources/Python</b></code></p> + <p><code>% <b>setenv PYTHONPATH `lldb -P`</b></p> <p>For sh and bash: - <p><code>% <b>export PYTHONPATH=/Developer/Library/PrivateFrameworks/LLDB.framework/Resources/Python</b></code></p> + <p><code>% <b>export PYTHONPATH=`lldb -P`</b></p> <p> Alternately, you can append the LLDB Python directory to the <b>sys.path</b> list directly in your Python code before importing the lldb module.</p> @@ -738,11 +928,64 @@ if target: <font color=green># We do have a symbol, print some info for the symbol</font> print symbol </tt></pre></code> - </div> - <div class="postfooter"></div> + </div> + <div class="postfooter"></div> + </div> + + <div class="post"> + <h1 class ="postheader">Writing LLDB frame recognizers in Python</h1> + <div class="postcontent"> + + <p>Frame recognizers allow for retrieving information about special frames based on + ABI, arguments or other special properties of that frame, even without source + code or debug info. Currently, one use case is to extract function arguments + that would otherwise be unaccesible, or augment existing arguments.</p> + + <p>Adding a custom frame recognizer is done by implementing a Python class + and using the '<b>frame recognizer add</b>' command. The Python class should have a + '<b>get_recognized_arguments</b>' method and it will receive an argument of type + <b>lldb.SBFrame</b> representing the current frame that we are trying to recognize. + The method should return a (possibly empty) list of <b>lldb.SBValue</b> objects that + represent the recognized arguments.</p> + + <p>An example of a recognizer that retrieves the file descriptor values from libc + functions '<b>read</b>', '<b>write</b>' and '<b>close</b>' follows:</p> + +<code><pre><tt> class LibcFdRecognizer(object): + def get_recognized_arguments(self, frame): + if frame.name in ["read", "write", "close"]: + fd = frame.EvaluateExpression("$arg1").unsigned + value = lldb.target.CreateValueFromExpression("fd", "(int)%d" % fd) + return [value] + return [] +</tt></pre></code> + + <p>The file containing this implementation can be imported via '<b>command script + import</b>' and then we can register this recognizer with '<b>frame recognizer add</b>'. + It's important to restrict the recognizer to the libc library (which is + libsystem_kernel.dylib on macOS) to avoid matching functions with the same name in other modules:</p> - </div> - </div> +<code><pre><tt>(lldb) <b>command script import .../fd_recognizer.py</b> +(lldb) <b>frame recognizer add -l fd_recognizer.LibcFdRecognizer -n read -s libsystem_kernel.dylib</b> +</tt></pre></code> + + <p>When the program is stopped at the beginning of the '<b>read</b>' function in libc, we + can view the recognizer arguments in '<b>frame variable</b>':</p> + +<code><pre><tt>(lldb) <b>b read</b> +(lldb) <b>r</b> +Process 1234 stopped +* thread #1, queue = 'com.apple.main-thread', stop reason = breakpoint 1.3 + frame #0: 0x00007fff06013ca0 libsystem_kernel.dylib`read +(lldb) <b>frame variable</b> +(int) fd = 3 +</tt></pre></code> + + </div> + <div class="postfooter"></div> + </div> + + </div> </div> </body> </html> |
