ObjC中的[NSString containsString:]这个函数使用了啥算法?

Posted

技术标签:

【中文标题】ObjC中的[NSString containsString:]这个函数使用了啥算法?【英文标题】:What algorithm is used in [NSString containString:] this function in ObjC?ObjC中的[NSString containsString:]这个函数使用了什么算法? 【发布时间】:2016-07-28 12:14:51 【问题描述】:

我想做一个带地址簿的应用,需要添加搜索功能。 我希望搜索功能可以支持字符串匹配,我在NSString类中找到了[NSString containString:]这个功能。 为了高效,我想用一个很好的字符串匹配算法来实现它,如KMP。所以我想知道这个函数使用了什么算法。还有在哪里可以查看NSString函数的源码?

谢谢。

【问题讨论】:

【参考方案1】:

程序集如下:

Foundation`-[NSString containsString:]:
->  0x10a128954 <+0>:  pushq  %rbp
    0x10a128955 <+1>:  movq   %rsp, %rbp
    0x10a128958 <+4>:  pushq  %r15
    0x10a12895a <+6>:  pushq  %r14
    0x10a12895c <+8>:  pushq  %rbx
    0x10a12895d <+9>:  pushq  %rax
    0x10a12895e <+10>: movq   %rdx, %r14
    0x10a128961 <+13>: movq   %rdi, %rbx
    0x10a128964 <+16>: movq   0x1836d5(%rip), %rsi      ; "length"
    0x10a12896b <+23>: movq   0x19ddee(%rip), %r15      ; (void *)0x000000010a4bb800: objc_msgSend
    0x10a128972 <+30>: callq  *%r15
    0x10a128975 <+33>: movq   0x185dc4(%rip), %rsi      ; "rangeOfString:options:range:locale:"
    0x10a12897c <+40>: movq   $0x0, (%rsp)
    0x10a128984 <+48>: xorl   %ecx, %ecx
    0x10a128986 <+50>: xorl   %r8d, %r8d
    0x10a128989 <+53>: movq   %rbx, %rdi
    0x10a12898c <+56>: movq   %r14, %rdx
    0x10a12898f <+59>: movq   %rax, %r9
    0x10a128992 <+62>: callq  *%r15
    0x10a128995 <+65>: movabsq $0x7fffffffffffffff, %rcx ; imm = 0x7FFFFFFFFFFFFFFF 
    0x10a12899f <+75>: cmpq   %rcx, %rax
    0x10a1289a2 <+78>: setne  %al
    0x10a1289a5 <+81>: addq   $0x8, %rsp
    0x10a1289a9 <+85>: popq   %rbx
    0x10a1289aa <+86>: popq   %r14
    0x10a1289ac <+88>: popq   %r15
    0x10a1289ae <+90>: popq   %rbp
    0x10a1289af <+91>: retq   

本质上是以下算法:

- (bool)containsString:(NSString *)stringToFind 

    if (stringToFind && stringToFind.length < self) 
        return (typeof(NSNotFound))[self rangeOfString:stringToFind options:NSLiteralSearch range:NSMakeRange(0, self.length) locale:nil].location != NSNotFound;
    
    return false;

附:它不检查字符串是否为 nil 或检查其长度。确实如此:

- (bool)containsString:(NSString *)stringToFind 

    return (typeof(NSNotFound))[self rangeOfString:stringToFind options:NSLiteralSearch range:NSMakeRange(0, self.length) locale:nil].location != NSNotFound;

没有安全检查。我不知道为什么我决定添加检查,但是 w/e..

字符串的范围有以下程序集:

Foundation`-[NSString rangeOfString:options:range:locale:]:
->  0x10a031cfd <+0>:   pushq  %rbp
    0x10a031cfe <+1>:   movq   %rsp, %rbp
    0x10a031d01 <+4>:   pushq  %r15
    0x10a031d03 <+6>:   pushq  %r14
    0x10a031d05 <+8>:   pushq  %r13
    0x10a031d07 <+10>:  pushq  %r12
    0x10a031d09 <+12>:  pushq  %rbx
    0x10a031d0a <+13>:  subq   $0x48, %rsp
    0x10a031d0e <+17>:  movq   %r9, %r15
    0x10a031d11 <+20>:  movq   %r8, %r14
    0x10a031d14 <+23>:  movq   %rcx, -0x40(%rbp)
    0x10a031d18 <+27>:  movq   %rdx, %r13
    0x10a031d1b <+30>:  movq   %rsi, -0x48(%rbp)
    0x10a031d1f <+34>:  movq   %rdi, %r12
    0x10a031d22 <+37>:  movq   0x27a317(%rip), %rsi      ; "length"
    0x10a031d29 <+44>:  movq   0x294a30(%rip), %rbx      ; (void *)0x000000010a4bb800: objc_msgSend
    0x10a031d30 <+51>:  movq   %r13, %rdi
    0x10a031d33 <+54>:  callq  *%rbx
    0x10a031d35 <+56>:  movq   %rax, -0x50(%rbp)
    0x10a031d39 <+60>:  movq   0x27a300(%rip), %rsi      ; "length"
    0x10a031d40 <+67>:  movq   %r12, %rdi
    0x10a031d43 <+70>:  callq  *%rbx
    0x10a031d45 <+72>:  movq   %rax, %rbx
    0x10a031d48 <+75>:  subq   %r15, %rax
    0x10a031d4b <+78>:  jb     0x10a031d56               ; <+89>
    0x10a031d4d <+80>:  cmpq   %r14, %rax
    0x10a031d50 <+83>:  jae    0x10a031e07               ; <+266>
    0x10a031d56 <+89>:  callq  0x10a239912               ; symbol stub for: __CFStringNoteErrors
    0x10a031d5b <+94>:  testb  %al, %al
    0x10a031d5d <+96>:  je     0x10a031e07               ; <+266>
    0x10a031d63 <+102>: movl   $0x6, %edi
    0x10a031d68 <+107>: callq  0x10a2396b4               ; symbol stub for: _CFExecutableLinkedOnOrAfter
    0x10a031d6d <+112>: testb  %al, %al
    0x10a031d6f <+114>: je     0x10a031dc5               ; <+200>
    0x10a031d71 <+116>: movq   0x281420(%rip), %rax      ; (void *)0x000000010ac5a358: NSException
    0x10a031d78 <+123>: movq   %rax, -0x58(%rbp)
    0x10a031d7c <+127>: movq   0x294325(%rip), %rax      ; (void *)0x000000010ac74b38: NSRangeException
    0x10a031d83 <+134>: movq   (%rax), %rax
    0x10a031d86 <+137>: movq   %rax, -0x60(%rbp)
    0x10a031d8a <+141>: movq   %r12, %rdi
    0x10a031d8d <+144>: movq   -0x48(%rbp), %rsi
    0x10a031d91 <+148>: callq  0x10a11f7e0               ; _NSMethodExceptionProem
    0x10a031d96 <+153>: movq   %rax, %r8
    0x10a031d99 <+156>: movq   0x27a288(%rip), %rsi      ; "raise:format:"
    0x10a031da0 <+163>: movq   %rbx, 0x8(%rsp)
    0x10a031da5 <+168>: movq   %r15, (%rsp)
    0x10a031da9 <+172>: leaq   0x2a4e40(%rip), %rcx      ; @"%@: Range %lu, %lu out of bounds; string length %lu"
    0x10a031db0 <+179>: xorl   %eax, %eax
    0x10a031db2 <+181>: movq   -0x58(%rbp), %rdi
    0x10a031db6 <+185>: movq   -0x60(%rbp), %rdx
    0x10a031dba <+189>: movq   %r14, %r9
    0x10a031dbd <+192>: callq  *0x29499d(%rip)           ; (void *)0x000000010a4bb800: objc_msgSend
    0x10a031dc3 <+198>: jmp    0x10a031e07               ; <+266>
    0x10a031dc5 <+200>: movb   0x29326d(%rip), %al       ; rangeOfString:options:range:locale:.warnonce
    0x10a031dcb <+206>: testb  %al, %al
    0x10a031dcd <+208>: jne    0x10a031e07               ; <+266>
    0x10a031dcf <+210>: movb   $0x1, 0x293262(%rip)      ; compare:options:range:locale:.localeClass + 7
    0x10a031dd6 <+217>: movq   %r12, %rdi
    0x10a031dd9 <+220>: movq   -0x48(%rbp), %rsi
    0x10a031ddd <+224>: callq  0x10a11f7e0               ; _NSMethodExceptionProem
    0x10a031de2 <+229>: movq   %rax, %rbx
    0x10a031de5 <+232>: movq   %r14, %rdi
    0x10a031de8 <+235>: movq   %r15, %rsi
    0x10a031deb <+238>: callq  0x10a1212f0               ; NSStringFromRange
    0x10a031df0 <+243>: movq   %rax, %rcx
    0x10a031df3 <+246>: leaq   0x2a4e36(%rip), %rdi      ; @"*** %@: Invalid range %@; this will become an exception for apps linked on SnowLeopard. Warning shown once per app execution."
    0x10a031dfa <+253>: xorl   %eax, %eax
    0x10a031dfc <+255>: movq   %rbx, %rsi
    0x10a031dff <+258>: movq   %rcx, %rdx
    0x10a031e02 <+261>: callq  0x10a06c78a               ; NSLog
    0x10a031e07 <+266>: testq  %r13, %r13
    0x10a031e0a <+269>: jne    0x10a031e5e               ; <+353>
    0x10a031e0c <+271>: callq  0x10a239912               ; symbol stub for: __CFStringNoteErrors
    0x10a031e11 <+276>: testb  %al, %al
    0x10a031e13 <+278>: je     0x10a031e5e               ; <+353>
    0x10a031e15 <+280>: movq   0x28137c(%rip), %rax      ; (void *)0x000000010ac5a358: NSException
    0x10a031e1c <+287>: movq   %rax, -0x58(%rbp)
    0x10a031e20 <+291>: movq   0x294241(%rip), %rax      ; (void *)0x000000010ac74b40: NSInvalidArgumentException
    0x10a031e27 <+298>: movq   (%rax), %rax
    0x10a031e2a <+301>: movq   %rax, -0x60(%rbp)
    0x10a031e2e <+305>: movq   %r12, %rdi
    0x10a031e31 <+308>: movq   -0x48(%rbp), %rsi
    0x10a031e35 <+312>: callq  0x10a11f7e0               ; _NSMethodExceptionProem
    0x10a031e3a <+317>: movq   %rax, %rbx
    0x10a031e3d <+320>: movq   0x27a1e4(%rip), %rsi      ; "raise:format:"
    0x10a031e44 <+327>: leaq   0x2a4dc5(%rip), %rcx      ; @"%@: nil argument"
    0x10a031e4b <+334>: xorl   %eax, %eax
    0x10a031e4d <+336>: movq   -0x58(%rbp), %rdi
    0x10a031e51 <+340>: movq   -0x60(%rbp), %rdx
    0x10a031e55 <+344>: movq   %rbx, %r8
    0x10a031e58 <+347>: callq  *0x294902(%rip)           ; (void *)0x000000010a4bb800: objc_msgSend
    0x10a031e5e <+353>: movq   0x10(%rbp), %r9
    0x10a031e62 <+357>: movq   -0x40(%rbp), %rcx
    0x10a031e66 <+361>: testb  $0x4, %ch
    0x10a031e69 <+364>: jne    0x10a031ebe               ; <+449>
    0x10a031e6b <+366>: movabsq $0x7fffffffffffffff, %rbx ; imm = 0x7FFFFFFFFFFFFFFF 
    0x10a031e75 <+376>: xorl   %edx, %edx
    0x10a031e77 <+378>: testq  %r15, %r15
    0x10a031e7a <+381>: je     0x10a031ede               ; <+481>
    0x10a031e7c <+383>: cmpq   $0x0, -0x50(%rbp)
    0x10a031e81 <+388>: je     0x10a031ede               ; <+481>
    0x10a031e83 <+390>: leaq   (,%rcx,8), %r8
    0x10a031e8b <+398>: notl   %r8d
    0x10a031e8e <+401>: andq   $0x10, %r8
    0x10a031e92 <+405>: orq    %rcx, %r8
    0x10a031e95 <+408>: leaq   -0x38(%rbp), %rax
    0x10a031e99 <+412>: movq   %rax, (%rsp)
    0x10a031e9d <+416>: movq   %r12, %rdi
    0x10a031ea0 <+419>: movq   %r13, %rsi
    0x10a031ea3 <+422>: movq   %r14, %rdx
    0x10a031ea6 <+425>: movq   %r15, %rcx
    0x10a031ea9 <+428>: callq  0x10a239354               ; symbol stub for: CFStringFindWithOptionsAndLocale
    0x10a031eae <+433>: xorl   %edx, %edx
    0x10a031eb0 <+435>: testb  %al, %al
    0x10a031eb2 <+437>: je     0x10a031ede               ; <+481>
    0x10a031eb4 <+439>: movq   -0x38(%rbp), %rbx
    0x10a031eb8 <+443>: movq   -0x30(%rbp), %rdx
    0x10a031ebc <+447>: jmp    0x10a031ede               ; <+481>
    0x10a031ebe <+449>: movq   0x27c873(%rip), %rsi      ; "_rangeOfRegularExpressionPattern:options:range:locale:"
    0x10a031ec5 <+456>: movq   %r9, (%rsp)
    0x10a031ec9 <+460>: movq   %r12, %rdi
    0x10a031ecc <+463>: movq   %r13, %rdx
    0x10a031ecf <+466>: movq   %r14, %r8
    0x10a031ed2 <+469>: movq   %r15, %r9
    0x10a031ed5 <+472>: callq  *0x294885(%rip)           ; (void *)0x000000010a4bb800: objc_msgSend
    0x10a031edb <+478>: movq   %rax, %rbx
    0x10a031ede <+481>: movq   %rbx, %rax
    0x10a031ee1 <+484>: addq   $0x48, %rsp
    0x10a031ee5 <+488>: popq   %rbx
    0x10a031ee6 <+489>: popq   %r12
    0x10a031ee8 <+491>: popq   %r13
    0x10a031eea <+493>: popq   %r14
    0x10a031eec <+495>: popq   %r15
    0x10a031eee <+497>: popq   %rbp
    0x10a031eef <+498>: retq   

它运行一个正则表达式来检查一个字符串是否有另一个字符串并且它有安全检查..

【讨论】:

【参考方案2】:

ios Foundation 是 Apple 的专有代码,因此您几乎不可能合法地看到 NSString 实现代码。

在你的情况下我会做的是:

为 [NSString containsString:] 函数(一个不错的字符串/模式对列表)构建并运行一些基准测试。

1234563结果。

【讨论】:

以上是关于ObjC中的[NSString containsString:]这个函数使用了啥算法?的主要内容,如果未能解决你的问题,请参考以下文章

ObjC / iOS:NSCFString 不是 NSString?

将 NSString 设置为等于另一个时的 Objc EXC_BAD_ACCESS

如何继承nsstring等类簇

在Selenium Webdriver中使用XPath ContainsSibling函数定位

在Selenium Webdriver中使用XPath ContainsSibling函数定位

在二维数组中搜索 tableView objC