内核中dump_stack的实现原理 —— 内核函数printk的实现

Posted pengdonglin137

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了内核中dump_stack的实现原理 —— 内核函数printk的实现相关的知识,希望对你有一定的参考价值。

在内核中使用dump_stack的时候可以看到如下用法:
static inline void print_ip_sym(unsigned long ip)

    printk("[<%px>] %pS\\n", (void *) ip, (void *) ip);
 
然后我们就可以看到类似如下的内核log:
技术图片
 
可以看到,上面不光输出了运行地址,还把对应的函数名以及偏移地址都列了出来,内核是如何做到的呢?
在调用printk的时候:
 
printk --> vprintk_default --> vprintk_emit --> vprintk_store --> vscnprintf --> vsnprintf --> pointer
 
上面的转换就是在pointer中完成的:
  1 /*
  2  * Show a ‘%p‘ thing.  A kernel extension is that the ‘%p‘ is followed
  3  * by an extra set of alphanumeric characters that are extended format
  4  * specifiers.
  5  *
  6  * Please update scripts/checkpatch.pl when adding/removing conversion
  7  * characters.  (Search for "check for vsprintf extension").
  8  *
  9  * Right now we handle:
 10  *
 11  * - ‘F‘ For symbolic function descriptor pointers with offset
 12  * - ‘f‘ For simple symbolic function names without offset
 13  * - ‘S‘ For symbolic direct pointers with offset
 14  * - ‘s‘ For symbolic direct pointers without offset
 15  * - ‘[FfSs]R‘ as above with __builtin_extract_return_addr() translation
 16  * - ‘B‘ For backtraced symbolic direct pointers with offset
 17  * - ‘R‘ For decoded struct resource, e.g., [mem 0x0-0x1f 64bit pref]
 18  * - ‘r‘ For raw struct resource, e.g., [mem 0x0-0x1f flags 0x201]
 19  * - ‘b[l]‘ For a bitmap, the number of bits is determined by the field
 20  *       width which must be explicitly specified either as part of the
 21  *       format string ‘%32b[l]‘ or through ‘%*b[l]‘, [l] selects
 22  *       range-list format instead of hex format
 23  * - ‘M‘ For a 6-byte MAC address, it prints the address in the
 24  *       usual colon-separated hex notation
 25  * - ‘m‘ For a 6-byte MAC address, it prints the hex address without colons
 26  * - ‘MF‘ For a 6-byte MAC FDDI address, it prints the address
 27  *       with a dash-separated hex notation
 28  * - ‘[mM]R‘ For a 6-byte MAC address, Reverse order (Bluetooth)
 29  * - ‘I‘ [46] for IPv4/IPv6 addresses printed in the usual way
 30  *       IPv4 uses dot-separated decimal without leading 0‘s (1.2.3.4)
 31  *       IPv6 uses colon separated network-order 16 bit hex with leading 0‘s
 32  *       [S][pfs]
 33  *       Generic IPv4/IPv6 address (struct sockaddr *) that falls back to
 34  *       [4] or [6] and is able to print port [p], flowinfo [f], scope [s]
 35  * - ‘i‘ [46] for ‘raw‘ IPv4/IPv6 addresses
 36  *       IPv6 omits the colons (01020304...0f)
 37  *       IPv4 uses dot-separated decimal with leading 0‘s (010.123.045.006)
 38  *       [S][pfs]
 39  *       Generic IPv4/IPv6 address (struct sockaddr *) that falls back to
 40  *       [4] or [6] and is able to print port [p], flowinfo [f], scope [s]
 41  * - ‘[Ii][4S][hnbl]‘ IPv4 addresses in host, network, big or little endian order
 42  * - ‘I[6S]c‘ for IPv6 addresses printed as specified by
 43  *       http://tools.ietf.org/html/rfc5952
 44  * - ‘E[achnops]‘ For an escaped buffer, where rules are defined by combination
 45  *                of the following flags (see string_escape_mem() for the
 46  *                details):
 47  *                  a - ESCAPE_ANY
 48  *                  c - ESCAPE_SPECIAL
 49  *                  h - ESCAPE_HEX
 50  *                  n - ESCAPE_NULL
 51  *                  o - ESCAPE_OCTAL
 52  *                  p - ESCAPE_NP
 53  *                  s - ESCAPE_SPACE
 54  *                By default ESCAPE_ANY_NP is used.
 55  * - ‘U‘ For a 16 byte UUID/GUID, it prints the UUID/GUID in the form
 56  *       "xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"
 57  *       Options for %pU are:
 58  *         b big endian lower case hex (default)
 59  *         B big endian UPPER case hex
 60  *         l little endian lower case hex
 61  *         L little endian UPPER case hex
 62  *           big endian output byte order is:
 63  *             [0][1][2][3]-[4][5]-[6][7]-[8][9]-[10][11][12][13][14][15]
 64  *           little endian output byte order is:
 65  *             [3][2][1][0]-[5][4]-[7][6]-[8][9]-[10][11][12][13][14][15]
 66  * - ‘V‘ For a struct va_format which contains a format string * and va_list *,
 67  *       call vsnprintf(->format, *->va_list).
 68  *       Implements a "recursive vsnprintf".
 69  *       Do not use this feature without some mechanism to verify the
 70  *       correctness of the format string and va_list arguments.
 71  * - ‘K‘ For a kernel pointer that should be hidden from unprivileged users
 72  * - ‘NF‘ For a netdev_features_t
 73  * - ‘h[CDN]‘ For a variable-length buffer, it prints it as a hex string with
 74  *            a certain separator (‘ ‘ by default):
 75  *              C colon
 76  *              D dash
 77  *              N no separator
 78  *            The maximum supported length is 64 bytes of the input. Consider
 79  *            to use print_hex_dump() for the larger input.
 80  * - ‘a[pd]‘ For address types [p] phys_addr_t, [d] dma_addr_t and derivatives
 81  *           (default assumed to be phys_addr_t, passed by reference)
 82  * - ‘d[234]‘ For a dentry name (optionally 2-4 last components)
 83  * - ‘D[234]‘ Same as ‘d‘ but for a struct file
 84  * - ‘g‘ For block_device name (gendisk + partition number)
 85  * - ‘C‘ For a clock, it prints the name (Common Clock Framework) or address
 86  *       (legacy clock framework) of the clock
 87  * - ‘Cn‘ For a clock, it prints the name (Common Clock Framework) or address
 88  *        (legacy clock framework) of the clock
 89  * - ‘Cr‘ For a clock, it prints the current rate of the clock
 90  * - ‘G‘ For flags to be printed as a collection of symbolic strings that would
 91  *       construct the specific value. Supported flags given by option:
 92  *       p page flags (see struct page) given as pointer to unsigned long
 93  *       g gfp flags (GFP_* and __GFP_*) given as pointer to gfp_t
 94  *       v vma flags (VM_*) given as pointer to unsigned long
 95  * - ‘O‘ For a kobject based struct. Must be one of the following:
 96  *       - ‘OF[fnpPcCF]‘  For a device tree object
 97  *                        Without any optional arguments prints the full_name
 98  *                        f device node full_name
 99  *                        n device node name
100  *                        p device node phandle
101  *                        P device node path spec (name + @unit)
102  *                        F device node flags
103  *                        c major compatible string
104  *                        C full compatible string
105  *
106  * ** Please update also Documentation/printk-formats.txt when making changes **
107  *
108  * Note: The difference between ‘S‘ and ‘F‘ is that on ia64 and ppc64
109  * function pointers are really function descriptors, which contain a
110  * pointer to the real address.
111  */
112 static noinline_for_stack
113 char *pointer(const char *fmt, char *buf, char *end, void *ptr,
114           struct printf_spec spec)
115 
116     const int default_width = 2 * sizeof(void *);
117 
118     if (!ptr && *fmt != K) 
119         /*
120          * Print (null) with the same width as a pointer so it makes
121          * tabular output look nice.
122          */
123         if (spec.field_width == -1)
124             spec.field_width = default_width;
125         return string(buf, end, "(null)", spec);
126     
127 
128     switch (*fmt) 
129     case F:
130     case f:
131         ptr = dereference_function_descriptor(ptr);
132         /* Fallthrough */
133     case S:
134     case s:
135     case B:
136         return symbol_string(buf, end, ptr, spec, fmt);
137     case R:
138     case r:
139         return resource_string(buf, end, ptr, spec, fmt);
140     case h:
141         return hex_string(buf, end, ptr, spec, fmt);
142     case b:
143         switch (fmt[1]) 
144         case l:
145             return bitmap_list_string(buf, end, ptr, spec, fmt);
146         default:
147             return bitmap_string(buf, end, ptr, spec, fmt);
148         
149     case M:            /* Colon separated: 00:01:02:03:04:05 */
150     case m:            /* Contiguous: 000102030405 */
151                     /* [mM]F (FDDI) */
152                     /* [mM]R (Reverse order; Bluetooth) */
153         return mac_address_string(buf, end, ptr, spec, fmt);
154     case I:            /* Formatted IP supported
155                      * 4:    1.2.3.4
156                      * 6:    0001:0203:...:0708
157                      * 6c:    1::708 or 1::1.2.3.4
158                      */
159     case i:            /* Contiguous:
160                      * 4:    001.002.003.004
161                      * 6:   000102...0f
162                      */
163         switch (fmt[1]) 
164         case 6:
165             return ip6_addr_string(buf, end, ptr, spec, fmt);
166         case 4:
167             return ip4_addr_string(buf, end, ptr, spec, fmt);
168         case S: 
169             const union 
170                 struct sockaddr        raw;
171                 struct sockaddr_in    v4;
172                 struct sockaddr_in6    v6;
173              *sa = ptr;
174 
175             switch (sa->raw.sa_family) 
176             case AF_INET:
177                 return ip4_addr_string_sa(buf, end, &sa->v4, spec, fmt);
178             case AF_INET6:
179                 return ip6_addr_string_sa(buf, end, &sa->v6, spec, fmt);
180             default:
181                 return string(buf, end, "(invalid address)", spec);
182             
183         
184         break;
185     case E:
186         return escaped_string(buf, end, ptr, spec, fmt);
187     case U:
188         return uuid_string(buf, end, ptr, spec, fmt);
189     case V:
190         
191             va_list va;
192 
193             va_copy(va, *((struct va_format *)ptr)->va);
194             buf += vsnprintf(buf, end > buf ? end - buf : 0,
195                      ((struct va_format *)ptr)->fmt, va);
196             va_end(va);
197             return buf;
198         
199     case K:
200         switch (kptr_restrict) 
201         case 0:
202             /* Always print %pK values */
203             break;
204         case 1: 
205             const struct cred *cred;
206 
207             /*
208              * kptr_restrict==1 cannot be used in IRQ context
209              * because its test for CAP_SYSLOG would be meaningless.
210              */
211             if (in_irq() || in_serving_softirq() || in_nmi()) 
212                 if (spec.field_width == -1)
213                     spec.field_width = default_width;
214                 return string(buf, end, "pK-error", spec);
215             
216 
217             /*
218              * Only print the real pointer value if the current
219              * process has CAP_SYSLOG and is running with the
220              * same credentials it started with. This is because
221              * access to files is checked at open() time, but %pK
222              * checks permission at read() time. We don‘t want to
223              * leak pointer values if a binary opens a file using
224              * %pK and then elevates privileges before reading it.
225              */
226             cred = current_cred();
227             if (!has_capability_noaudit(current, CAP_SYSLOG) ||
228                 !uid_eq(cred->euid, cred->uid) ||
229                 !gid_eq(cred->egid, cred->gid))
230                 ptr = NULL;
231             break;
232         
233         case 2:
234         default:
235             /* Always print 0‘s for %pK */
236             ptr = NULL;
237             break;
238         
239         break;
240 
241     case N:
242         return netdev_bits(buf, end, ptr, fmt);
243     case a:
244         return address_val(buf, end, ptr, fmt);
245     case d:
246         return dentry_name(buf, end, ptr, spec, fmt);
247     case C:
248         return clock(buf, end, ptr, spec, fmt);
249     case D:
250         return dentry_name(buf, end,
251                    ((const struct file *)ptr)->f_path.dentry,
252                    spec, fmt);
253 #ifdef CONFIG_BLOCK
254     case g:
255         return bdev_name(buf, end, ptr, spec, fmt);
256 #endif
257 
258     case G:
259         return flags_string(buf, end, ptr, fmt);
260     case O:
261         switch (fmt[1]) 
262         case F:
263             return device_node_string(buf, end, ptr, spec, fmt + 1);
264         
265     
266     spec.flags |= SMALL;
267     if (spec.field_width == -1) 
268         spec.field_width = default_width;
269         spec.flags |= ZEROPAD;
270     
271     spec.base = 16;
272 
273     return number(buf, end, (unsigned long) ptr, spec);
274 
第136行, symbol_string(buf, end, ptr, spec, fmt);
 1 static noinline_for_stack
 2 char *symbol_string(char *buf, char *end, void *ptr,
 3             struct printf_spec spec, const char *fmt)
 4 
 5     unsigned long value;
 6 #ifdef CONFIG_KALLSYMS
 7     char sym[KSYM_SYMBOL_LEN];
 8 #endif
 9 
10     if (fmt[1] == R)
11         ptr = __builtin_extract_return_addr(ptr);
12     value = (unsigned long)ptr;
13 
14 #ifdef CONFIG_KALLSYMS
15     if (*fmt == B)
16         sprint_backtrace(sym, value);
17     else if (*fmt != f && *fmt != s)
18         sprint_symbol(sym, value);
19     else
20         sprint_symbol_no_offset(sym, value);
21 
22     return string(buf, end, sym, spec);
23 #else
24     return special_hex_number(buf, end, value, sizeof(void *));
25 #endif
26 
第20行:sprint_symbol_no_offset
 1 /**
 2  * sprint_symbol_no_offset - Look up a kernel symbol and return it in a text buffer
 3  * @buffer: buffer to be stored
 4  * @address: address to lookup
 5  *
 6  * This function looks up a kernel symbol with @address and stores its name
 7  * and module name to @buffer if possible. If no symbol was found, just saves
 8  * its @address as is.
 9  *
10  * This function returns the number of bytes stored in @buffer.
11  */
12 int sprint_symbol_no_offset(char *buffer, unsigned long address)
13 
14     return __sprint_symbol(buffer, address, 0, 0);
15 

 

完。

以上是关于内核中dump_stack的实现原理 —— 内核函数printk的实现的主要内容,如果未能解决你的问题,请参考以下文章

linux内核调试技巧之一 dump_stack

Android系统源码或内核中打印函数调用堆栈(打印堆栈)

Android系统源码或内核中打印函数调用堆栈(打印堆栈)

Android系统源码或内核中打印函数调用堆栈(打印堆栈)

在linux代码中打印函数调用的堆栈的方法

驱动内核定时器的使用