Description
The query redactor is redacting the query itself while the expression shown in lineage info is not, which may still expose sensitive info. The following query
select customers.id, customers.name from customers where customers.addresses['shipping'].zip_code ='1234-5678-1234-5678'; will have a log entry in lineage. The expression should also be redacted.
[HiveServer2-Background-Pool: Thread-43]: {"version":"1.0","user":"hive","timestamp":1510179280,"duration":40747,"jobIds":["job_1510150684172_0006"],"engine":"mr","database":"default","hash":"a2b4721a0935e3770d81649d24ab1cd4","queryText":"select customers.id, customers.name from customers where customers.addresses['shipping'].zip_code ='XXXX-XXXX-XXXX-XXXX'","edges":[{"sources":[2],"targets":[0],"edgeType":"PROJECTION"},{"sources":[3],"targets":[1],"edgeType":"PROJECTION"},{"sources":[],"targets":[0,1],"expression":"(addresses['shipping'].zip_code = '1234-5678-1234-5678')","edgeType":"PREDICATE"}],"vertices":[{"id":0,"vertexType":"COLUMN","vertexId":"customers.id"},{"id":1,"vertexType":"COLUMN","vertexId":"customers.name"},{"id":2,"vertexType":"COLUMN","vertexId":"default.customers.id"},{"id":3,"vertexType":"COLUMN","vertexId":"default.customers.name"}]}