gcc x86-32堆栈对齐并调用printf

Question

据我所知，x86-64要求堆栈在调用之前是16字节对齐，而gcc with -m32 doesn't require this for main。

我有以下测试代码：

.data
intfmt:         .string "int: %d
"
testint:        .int    20

.text
.globl main

main:
    mov     %esp, %ebp
    push    testint
    push    $intfmt
    call    printf
    mov     %ebp, %esp
    ret

用as --32 test.S -o test.o && gcc -m32 test.o -o test构建。我知道syscall写存在，但据我所知它不能打印整数和浮动printf的方式。

进入main后，堆栈上有一个4字节的返回地址。然后天真地解释这个代码，两个push调用各自在堆栈上放置4个字节，因此调用需要另一个4字节值推送对齐。

这是gas和gcc生成的二进制文件的objdump：

0000053d <main>:
 53d:   89 e5                   mov    %esp,%ebp
 53f:   ff 35 1d 20 00 00       pushl  0x201d
 545:   68 14 20 00 00          push   $0x2014
 54a:   e8 fc ff ff ff          call   54b <main+0xe>
 54f:   89 ec                   mov    %ebp,%esp
 551:   c3                      ret    
 552:   66 90                   xchg   %ax,%ax
 554:   66 90                   xchg   %ax,%ax
 556:   66 90                   xchg   %ax,%ax
 558:   66 90                   xchg   %ax,%ax
 55a:   66 90                   xchg   %ax,%ax
 55c:   66 90                   xchg   %ax,%ax
 55e:   66 90                   xchg   %ax,%ax

我对生成的推送指令非常困惑。

如果按下两个4字节值，如何实现对齐？
为什么要推0x2014而不是0x14？什么是0x201d？
call 54b甚至实现了什么？ hd的输出与objdump匹配。为什么这在gdb中有所不同？这是动态链接器吗？

B+>│0x5655553d <main>                       mov    %esp,%ebp                      │
   │0x5655553f <main+2>                     pushl  0x5655701d                     │
   │0x56555545 <main+8>                     push   $0x56557014                    │
   │0x5655554a <main+13>                    call   0xf7e222d0 <printf>            │
   │0x5655554f <main+18>                    mov    %ebp,%esp                      │
   │0x56555551 <main+20>                    ret

关于实际执行二进制文件时所发生的事情的资源是值得赞赏的，因为我不知道实际发生了什么以及我读过的教程没有涵盖它。我正在通过How programs get run: ELF binaries阅读。