hadoop - What does the following fields: 'totalSize' and 'rawDataSize' mean in DESCRIBE EXTENDED query output in hive? -


if 1 runs describe extended command on hive table result presents totalsize , rawdatasize values near end of output.

what these fields mean?

ex:

hive > describe extended <tablename>  output results:  table(tablename:tablenamexxxxx, dbname:xxxxxx, ..........       ....................... numrows=116429472, totalsize=3835205544, rawdatasize=35040221600}) 

rawdatasize size of original data set, totalsize amount of storage takes. applicable orc file format, compresses data totalsize lesser rawdatasize.


Comments

Popular posts from this blog

how to insert data php javascript mysql with multiple array session 2 -

multithreading - Exception in Application constructor -

windows - CertCreateCertificateContext returns CRYPT_E_ASN1_BADTAG / 8009310b -