Hey everyone,
currently working through the lendingclub dataset. My project is simply to predict whether a borrower will default using only the info available at the of the application.
Problem: I cannot figure out which features were available then and which would leak. I have poured over the data dict and found similar projects. There does not seem to be any consensus on which features do not leak the loan outcome.
I have rewritten my code multiple times and am out of ideas. Is there any reports or further info regarding this?
Thanks
submitted by /u/loblawslawcah
[link] [comments]