hadoop - What does the following fields: 'totalSize' and 'rawDataSize' mean in DESCRIBE EXTENDED query output in hive? -
if 1 runs describe extended command on hive table result presents totalsize , rawdatasize values near end of output.
what these fields mean?
ex:
hive > describe extended <tablename> output results: table(tablename:tablenamexxxxx, dbname:xxxxxx, .......... ....................... numrows=116429472, totalsize=3835205544, rawdatasize=35040221600})
rawdatasize size of original data set, totalsize amount of storage takes. applicable orc file format, compresses data totalsize lesser rawdatasize.
Comments
Post a Comment