Symptom:
In spark-shell, "msck repair" of a partition table named "database_name.table_name" fails with error "Expecting only one partition but more than one partitions are found.".Env:
Hive 2.1(with MySQL as the backend database for Hive Metastore)Spark 2.2.1
Troubleshooting:
The source code for this error message is :metastore/src/java/org/apache/hadoop/hive/metastore/ObjectStore.java
query = pm.newQuery(MPartition.class, "table.tableName == t1 && table.database.name == t2 && partitionName == t3"); query.declareParameters("java.lang.String t1, java.lang.String t2, java.lang.String t3"); mparts = (List<MPartition>) query.execute(tableName, dbName, name); pm.retrieveAll(mparts); commited = commitTransaction(); // We need to compare partition name with requested name since some DBs // (like MySQL, Derby) considers 'a' = 'a ' whereas others like (Postgres, // Oracle) doesn't exhibit this problem. if (mparts != null && mparts.size() > 0) { if (mparts.size() > 1) { throw new MetaException( "Expecting only one partition but more than one partitions are found."); }
Basically it will generate a query in MySQL(Hive Metastore backend database) to check if there are any duplicate entries based on Table Name, Database Name and Partition Name.
Then we can run below query in MySQL to find out the duplicate entries from PARTITIONS table for that specific Hive partition table -- database_name.table_name:
select p.PART_NAME,count(*) from TBLS t, PARTITIONS p, DBS d where d.DB_ID=t.DB_ID and p.TBL_ID=t.TBL_ID and t.TBL_NAME='table_name' and d.NAME='database_name' group by p.PART_NAME having count(*)>1;
If above query returns one partition name as "key=0", then it means there are 2 or more entries inside PARTITIONS table for that specific partition "key=0".
Then we need to list all the data for that partition:
select p.* from TBLS t, PARTITIONS p, DBS d where d.DB_ID=t.DB_ID and p.TBL_ID=t.TBL_ID and t.TBL_NAME='table_name' and d.NAME='database_name' and p.PART_NAME='key=0';For example, if above query returns two entries as below:
+---------+-------------+------------------+-----------+-------+--------+ | PART_ID | CREATE_TIME | LAST_ACCESS_TIME | PART_NAME | SD_ID | TBL_ID | +---------+-------------+------------------+-----------+-------+--------+ | 255 | 1536697318 | 0 | key=0 | 262 | 7 | | 256 | 1536697319 | 0 | key=0 | 263 | 7 | +---------+-------------+------------------+-----------+-------+--------+So in all, above results show there is already inconsistency such as duplicate entries in Hive Metastore backend database.
Root Cause:
For this specific case, we found that there were no constraints created for those Hive Metastore backend MySQL tables.Take "PARTITIONS" table for example, we can run "show create table PARTITIONS;" in MySQL and check if below 2 FOREIGN KEY CONSTRAINT exists or not:
CONSTRAINT `PARTITIONS_FK1` FOREIGN KEY (`TBL_ID`) REFERENCES `TBLS` (`TBL_ID`), CONSTRAINT `PARTITIONS_FK2` FOREIGN KEY (`SD_ID`) REFERENCES `SDS` (`SD_ID`)
Solution:
1. Short term fix
The safest way to clean the duplicate entries is to use Hive command instead of manually deleting entries in MySQL.We just need to drop and re-create that problematic Hive partition, and after that, those duplicate entries are cleaned.
For example:
ALTER TABLE database_name.table_name DROP PARTITION(key=0);After that, double confirm all the duplicate entries are gone in MySQL:
select p.PART_NAME,count(*) from TBLS t, PARTITIONS p, DBS d where d.DB_ID=t.DB_ID and p.TBL_ID=t.TBL_ID and t.TBL_NAME='table_name' and d.NAME='database_name' group by p.PART_NAME having count(*)>1;
Wow this blog is very nice …
ReplyDelete