hbase 查看hfile文件

Posted ilifeilong

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了hbase 查看hfile文件相关的知识,希望对你有一定的参考价值。

emp表数据结构

hbase(main):098:0> scan emp
ROW                                                              COLUMN+CELL                                                                                                                                                                                 
 row1                                                            column=mycf:depart, timestamp=1555846776542, value=research                                                                                                                                 
 row1                                                            column=mycf:id, timestamp=1555846776590, value=7876                                                                                                                                         
 row1                                                            column=mycf:job, timestamp=1555846776566, value=clerk                                                                                                                                       
 row1                                                            column=mycf:locate, timestamp=1555846776618, value=dallas                                                                                                                                   
 row1                                                            column=mycf:name, timestamp=1555846776511, value=adams                                                                                                                                      
 row2                                                            column=mycf:depart, timestamp=1555846776687, value=sales                                                                                                                                    
 row2                                                            column=mycf:id, timestamp=1555846776736, value=7499                                                                                                                                         
 row2                                                            column=mycf:job, timestamp=1555846776712, value=salesman                                                                                                                                    
 row2                                                            column=mycf:locate, timestamp=1555846776770, value=chicago                                                                                                                                  
 row2                                                            column=mycf:name, timestamp=1555846776662, value=allen                                                                                                                                      
 row3                                                            column=mycf:depart, timestamp=1555846776838, value=sales                                                                                                                                    
 row3                                                            column=mycf:id, timestamp=1555846776887, value=7698                                                                                                                                         
 row3                                                            column=mycf:job, timestamp=1555846776863, value=manager                                                                                                                                     
 row3                                                            column=mycf:locate, timestamp=1555846776912, value=chicago                                                                                                                                  
 row3                                                            column=mycf:name, timestamp=1555846776806, value=blake                                                                                                                                      
 row4                                                            column=mycf:depart, timestamp=1555846776976, value=accounting                                                                                                                               
 row4                                                            column=mycf:id, timestamp=1555846777027, value=7782                                                                                                                                         
 row4                                                            column=mycf:job, timestamp=1555846777002, value=manager                                                                                                                                     
 row4                                                            column=mycf:locate, timestamp=1555846777086, value=new york                                                                                                                                 
 row4                                                            column=mycf:name, timestamp=1555846776952, value=clark                                                                                                                                      
 row5                                                            column=mycf:depart, timestamp=1555846777146, value=research                                                                                                                                 
 row5                                                            column=mycf:id, timestamp=1555846777193, value=7902                                                                                                                                         
 row5                                                            column=mycf:job, timestamp=1555846777169, value=analyst                                                                                                                                     
 row5                                                            column=mycf:locate, timestamp=1555846777218, value=dallas                                                                                                                                   
 row5                                                            column=mycf:name, timestamp=1555846777121, value=ford                                                                                                                                       
 row6                                                            column=mycf:depart, timestamp=1555846777277, value=sales                                                                                                                                    
 row6                                                            column=mycf:id, timestamp=1555846777324, value=7900                                                                                                                                         
 row6                                                            column=mycf:job, timestamp=1555846777301, value=clerk                                                                                                                                       
 row6                                                            column=mycf:locate, timestamp=1555846777355, value=chicago                                                                                                                                  
 row6                                                            column=mycf:name, timestamp=1555846777253, value=james                                                                                                                                      
 row7                                                            column=mycf:depart, timestamp=1555846777416, value=research                                                                                                                                 
 row7                                                            column=mycf:id, timestamp=1555846777465, value=7566                                                                                                                                         
 row7                                                            column=mycf:job, timestamp=1555846777441, value=manager                                                                                                                                     
 row7                                                            column=mycf:locate, timestamp=1555846777491, value=dallas                                                                                                                                   
 row7                                                            column=mycf:name, timestamp=1555846777390, value=jones                                                                                                                                      
 row8                                                            column=mycf:depart, timestamp=1555846777556, value=accounting                                                                                                                               
 row8                                                            column=mycf:id, timestamp=1555846777604, value=7839                                                                                                                                         
 row8                                                            column=mycf:job, timestamp=1555846777581, value=president                                                                                                                                   
 row8                                                            column=mycf:locate, timestamp=1555846777628, value=new york                                                                                                                                 
 row8                                                            column=mycf:name, timestamp=1555846777526, value=king                                                                                                                                       
8 row(s) in 0.0490 seconds

工具

org.apache.hadoop.hbase.io.hfile.HFile

# hbase org.apache.hadoop.hbase.io.hfile.HFile
usage: HFile [-a] [-b] [-e] [-f <arg>] [-k] [-m] [-p] [-r <arg>] [-s] [-v]
 -a,--checkfamily    Enable family check
 -b,--printblocks    Print block index meta data
 -e,--printkey       Print keys
 -f,--file <arg>     File to scan. Pass full-path; e.g.
                     hdfs://a:9000/hbase/.META./12/34
 -k,--checkrow       Enable row order check; looks for out-of-order keys
 -m,--printmeta      Print meta data of file
 -p,--printkv        Print key/value pairs
 -r,--region <arg>   Region to scan. Pass region name; e.g. .META.,,1
 -s,--stats          Print statistics
 -v,--verbose        Verbose output; emits file and meta data delimiters

 

 

# hbase org.apache.hadoop.hbase.io.hfile.HFile -f /hbase/emp/2dddf0f7140e120718b6d4356dfcee85/mycf/cab01eb30627452e8e38defad2144996 -e -p -m -s
19/05/10 21:39:27 INFO hfile.CacheConfig: Allocating LruBlockCache with maximum size 511.0m
K: row1/mycf:depart/1555846776542/Put/vlen=8 V: research
K: row1/mycf:id/1555846776590/Put/vlen=4 V: 7876
K: row1/mycf:job/1555846776566/Put/vlen=5 V: clerk
K: row1/mycf:locate/1555846776618/Put/vlen=6 V: dallas
K: row1/mycf:name/1555846776511/Put/vlen=5 V: adams
K: row2/mycf:depart/1555846776687/Put/vlen=5 V: sales
K: row2/mycf:id/1555846776736/Put/vlen=4 V: 7499
K: row2/mycf:job/1555846776712/Put/vlen=8 V: salesman
K: row2/mycf:locate/1555846776770/Put/vlen=7 V: chicago
K: row2/mycf:name/1555846776662/Put/vlen=5 V: allen
K: row3/mycf:depart/1555846776838/Put/vlen=5 V: sales
K: row3/mycf:id/1555846776887/Put/vlen=4 V: 7698
K: row3/mycf:job/1555846776863/Put/vlen=7 V: manager
K: row3/mycf:locate/1555846776912/Put/vlen=7 V: chicago
K: row3/mycf:name/1555846776806/Put/vlen=5 V: blake
K: row4/mycf:depart/1555846776976/Put/vlen=10 V: accounting
K: row4/mycf:id/1555846777027/Put/vlen=4 V: 7782
K: row4/mycf:job/1555846777002/Put/vlen=7 V: manager
K: row4/mycf:locate/1555846777086/Put/vlen=8 V: new york
K: row4/mycf:name/1555846776952/Put/vlen=5 V: clark
K: row5/mycf:depart/1555846777146/Put/vlen=8 V: research
K: row5/mycf:id/1555846777193/Put/vlen=4 V: 7902
K: row5/mycf:job/1555846777169/Put/vlen=7 V: analyst
K: row5/mycf:locate/1555846777218/Put/vlen=6 V: dallas
K: row5/mycf:name/1555846777121/Put/vlen=4 V: ford
K: row6/mycf:depart/1555846777277/Put/vlen=5 V: sales
K: row6/mycf:id/1555846777324/Put/vlen=4 V: 7900
K: row6/mycf:job/1555846777301/Put/vlen=5 V: clerk
K: row6/mycf:locate/1555846777355/Put/vlen=7 V: chicago
K: row6/mycf:name/1555846777253/Put/vlen=5 V: james
K: row7/mycf:depart/1555846777416/Put/vlen=8 V: research
K: row7/mycf:id/1555846777465/Put/vlen=4 V: 7566
K: row7/mycf:job/1555846777441/Put/vlen=7 V: manager
K: row7/mycf:locate/1555846777491/Put/vlen=6 V: dallas
K: row7/mycf:name/1555846777390/Put/vlen=5 V: jones
K: row8/mycf:depart/1555846777556/Put/vlen=10 V: accounting
K: row8/mycf:id/1555846777604/Put/vlen=4 V: 7839
K: row8/mycf:job/1555846777581/Put/vlen=9 V: president
K: row8/mycf:locate/1555846777628/Put/vlen=8 V: new york
K: row8/mycf:name/1555846777526/Put/vlen=4 V: king
Block index size as per heapsize: 416
reader=/hbase/emp/2dddf0f7140e120718b6d4356dfcee85/mycf/cab01eb30627452e8e38defad2144996,
    compression=none,
    cacheConf=CacheConfig:enabled [cacheDataOnRead=true] [cacheDataOnWrite=false] [cacheIndexesOnWrite=false] [cacheBloomsOnWrite=false] [cacheEvictOnClose=false] [cacheCompressed=false],
    firstKey=row1/mycf:depart/1555846776542/Put,
    lastKey=row8/mycf:name/1555846777526/Put,
    avgKeyLen=24,
    avgValueLen=5,
    entries=40,
    length=2155
Trailer:
    fileinfoOffset=1678,
    loadOnOpenDataOffset=1591,
    dataIndexCount=1,
    metaIndexCount=0,
    totalUncomressedBytes=2092,
    entryCount=40,
    compressionCodec=NONE,
    uncompressedDataIndexSize=39,
    numDataIndexLevels=1,
    firstDataBlockOffset=0,
    lastDataBlockOffset=0,
    comparatorClassName=org.apache.hadoop.hbase.KeyValue$KeyComparator,
    version=2
Fileinfo:
    KEY_VALUE_VERSION = \x00\x00\x00\x01
    MAJOR_COMPACTION_KEY = \x00
    MAX_MEMSTORE_TS_KEY = \x00\x00\x00\x00\x00\x00\x00\x00
    MAX_SEQ_ID_KEY = 7099
    TIMERANGE = 1555846776511....1555846777628
    hfile.AVG_KEY_LEN = 24
    hfile.AVG_VALUE_LEN = 5
    hfile.LASTKEY = \x00\x04row8\x04mycfname\x00\x00\x01j?\xB1\xCA\xB6\x04
Mid-key: \x00\x04row1\x04mycfdepart\x00\x00\x01j?\xB1\xC6\xDE\x04
Bloom filter:
    Not present
Stats:
Key length: count: 40    min: 22    max: 26    mean: 24.2
Val length: count: 40    min: 4    max: 10    mean: 5.975
Row size (bytes): count: 8    min: 187    max: 196    mean: 190.875
Row size (columns): count: 8    min: 5    max: 5    mean: 5.0
Key of biggest row: row8
Scanned kv count -> 40

 

以上是关于hbase 查看hfile文件的主要内容,如果未能解决你的问题,请参考以下文章

HBase篇-你不知道的HFile

HBase篇-你不知道的HFile

如何获取 HBase 表的 HFile 大小?

HBase里的HFile

Hbase存储实现

HBASE