Hey Andy ...dabal.clean should not remove missed value column, it might be a different pattern . It can tell missing value columns list and treat it separately. Other steps like constants and sparse categorical or ID info is fine . Some case Foreign key can be used for counts too as a feature. Looks messy if pandas profiling can be used with dabl package in junction to get a beautiful DF with all the details to be used or not used will help...what’s your view.
Missing values are not actually removed, they are encoded as a separate category. If you have ideas of how to integrate better with pandas profiling, I'm all ears!
Hey Andy ...dabal.clean should not remove missed value column, it might be a different pattern . It can tell missing value columns list and treat it separately. Other steps like constants and sparse categorical or ID info is fine . Some case Foreign key can be used for counts too as a feature. Looks messy if pandas profiling can be used with dabl package in junction to get a beautiful DF with all the details to be used or not used will help...what’s your view.
Missing values are not actually removed, they are encoded as a separate category. If you have ideas of how to integrate better with pandas profiling, I'm all ears!
Fun talk! Thanks Andy.