Explainable AI explained! | #3 LIME

DeepFindr

มุมมอง 64 832

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 6 ม.ค. 2025

ความคิดเห็น • 85

@sakshamgulati1578 3 ปีที่แล้ว ⁺²⁵
One of the best and wholesome explanations on this topic! Kudos!
@arturocdb ปีที่แล้ว ⁺¹
i just want it to ongratulate you!... you are the only one in you tube that can have a solid background (books and papers) about what is teaching in a video. thaank you so much... your video help me a lot!...,
@briggstwitchell1654 10 หลายเดือนก่อน ⁺¹
Wow, very clear and concise explanation of a fairly complex technique. Thank you very much for the effort you put in to this. Very helpful!
@yangbai657 2 ปีที่แล้ว ⁺⁶
The objective function part is amazingly well explained.
@jonsugarjones หลายเดือนก่อน
Very well prepared and carefully delivered, congratulations and many thanks for your help in understanding LIME.
@dvabhinav31 9 หลายเดือนก่อน
Wow! Such a cool explanation. I had some intuition issues after reading the paper that are now solved due to this video. Thanks a lot!
@techjack6307 2 ปีที่แล้ว ⁺³
Concise and great explanation on the topic.
@보라색사과-l1r 3 ปีที่แล้ว ⁺⁴
great work! I am a lucky person cuz I found this channel, as My major interests are GNN and XAI. thank you so much for brilliant works.
@DeepFindr 3 ปีที่แล้ว
Perfect :)
@SuperMaker.M 8 หลายเดือนก่อน
Finally a channel with good content!
@abhirama 2 ปีที่แล้ว ⁺¹
Amazingly clear explanation.
@hqi7573 3 ปีที่แล้ว ⁺²
Really cool video. Love the format that mix theory and coding. Just one point for disscussion: We need to measure Recall even for a binary classifier or for balanced data. It measrus a classifier's ability to find out all positive examples. And precison measures how well are the positive predictions. So they measure two different type of errors. In statistics we talk about Error of type one or type two
@DeepFindr 3 ปีที่แล้ว
Thanks!
Yes you are right. Ideally it makes sense to look at all the metrics that are commonly used :) ROC, accuracy, precision, recall,...
@ngwoonyee8001 6 หลายเดือนก่อน
thank you so much for the splendid explanation!
@ayaeldeeb19 ปีที่แล้ว
Really amazing!! It's very exciting and cool to conduct such research; it is also very explainable."
@shalombentenzingnamchyo9384 ปีที่แล้ว
wow 💯 Great explanation
@nintishia 3 ปีที่แล้ว
Excellent explanation of LIME.
@heejuneAhn ปีที่แล้ว
When the feature is not continuous, how do we sample the neighborhood? For example, binary features, Just a combination of features? or consider the binary features to be continuous ones?
@akshaybareja4697 3 ปีที่แล้ว
Beautifully explained
@aleixnieto88 2 ปีที่แล้ว
Damn bro, this helped me a lot with my final thesis
@DeepFindr 2 ปีที่แล้ว ⁺¹
That's great :) good luck with it!
@youssefbenhachem993 2 ปีที่แล้ว
It is just clearly explained ! Thanks a lot
@heejuneAhn ปีที่แล้ว
LIME can explain how the Complex model behaviors about the interested sample point, but it cannot guarantee the prediction of the complex model around the sample is correct predictions. am I right? Also, we could use LIME to explain the incorrect prediction of the complex model. For example, we could take two samples one correct prediction and one incorrect one, then we could make a simple surrogate model abound the two samples so that we could see how the complex model predicts and what is wrong with the complex model. Am I right again?
@joaquinecc 9 หลายเดือนก่อน
Amazing Explanation!!!. Thank u
@Therapist659 หลายเดือนก่อน
Sir how can we use it with NN?
@heejuneAhn ปีที่แล้ว ⁺¹
I think in the last example, you cannot say the model is wrong or not. LIME just explain how it behaves near the sample. If you can explain so easily why do you use complex model. You only can say using the ground truth. Some features that you might think is not logical still might affect that specific local points.
@borauyar 3 ปีที่แล้ว ⁺³
13:20 I believe "edu" has something to do with "atheism" :) Thanks for the explanations!
@DeepFindr 3 ปีที่แล้ว
Haha yes might be ;-)
@Anirudh-cf3oc 2 ปีที่แล้ว
Thank you for the amazing explaination
@sanjanasuresh5565 2 ปีที่แล้ว
Can Lime be used for unsupervised ML algorithms?
@heejuneAhn ปีที่แล้ว
Why do you use different z and z' for f(z) and g(z')? I cannot get it.
@bryanparis7779 2 ปีที่แล้ว
Does LIME uses Permutation or Perturbation (tweaking feature values) of the input of interest (x) we would like to explain?
@menensa 3 ปีที่แล้ว ⁺¹
Fantastic !!!
@MoMo-ib9ej ปีที่แล้ว
Du bist sehr hilfreich! Quick questions from a relatively non tech person:
1) How does this scale for models with millions of params eg could you even use this on an LLM?
2) Am I right in understanding that you need a mini real data set to train locally every time then?
@DeepFindr ปีที่แล้ว ⁺¹
Hehe thanks :)
1) I'm not sure if LIME is the right approach for LLMs. As these models mainly operate on numeric vectors it's difficult to fit a simple interpretable model on these. Also there are no real features as everything is very high-dimensional. One easy approach is to look at the attention weights. This allows you to understand which words the model used and where it "looked at" for some output. I found a nice visualization of that in this blog post: mohitmayank.medium.com/explainable-ai-language-models-b4b75f56bfe2
2) Yes for every prediction a small dataset is sampled to train the surrogate model
@durgadeviarulrajan4560 2 ปีที่แล้ว
Hi thanks for this wonderful video. I have a question, the perturbed dataset created by LIME is like fake datasets right. How do that data points have predictions or target variables?? Whether it means, the black box model is trained again on these perturbed datapoints to get the predictions??
@DeepFindr 2 ปีที่แล้ว
Hi! The target variables of this fake dataset come from the "complex" black box model. This means we simply put these points into the Blackbox and use the predictions as labels. Then we fit a new simple model with this new dataset :)
@rafsanjany2936 2 ปีที่แล้ว
Hello there, i got a an error like "ufunc 'subtract' did not contain a loop with signature matching types (dtype('
@DeepFindr 2 ปีที่แล้ว
I think something might be wrong with the columns in your dataframe. More specifically I would double check the column types and cast all float64 to float32 using the to_numeric function
@DanielWeikert 3 ปีที่แล้ว
So when we deal with classification. Is the Local Model for lime then a LogisticRegression (Linear Regression would only work for Regression problems right)? X would be the features from the nearest points to the one we want to explain. Would y be the original target (so stroke or no stroke in this case)?
Thanks and great video.
@DeepFindr 3 ปีที่แล้ว ⁺¹
Hi! Yes for a regression task a logistic regression would be fitted, for instance. X are the features of the input data point we want to explain. Out of those we would generate a neighborhood with similar features X'.
y is the original target, yes, but it is not used by the algorithm, because at prediction time we don't have that information.
In the code I pass y_test to the interpretml library to additionally show the actual labels.
Thanks!
@DanielWeikert 3 ปีที่แล้ว
@@DeepFindr So we would not use samples from the training set which are close as neighbors but instead create artifical new ones?
In order to get the Betas and therefore the importance how do we fit the Logistic Regression?
My first understanding was using close training samples, fit the Log Reg on those and the training y's and then derive the beta/importance only from those.
BR
@DeepFindr 3 ปีที่แล้ว ⁺¹
@@DanielWeikert Hi, I guess my wording was a bit irritating here. Yes the training samples are used and weighted according to their proximity. However we do not take their (ground truth) labels y. Our labels come from the Blackbox model (the predictions on these data points), because we want to approximate it.
But that also means we are not limited to the training data, we could also generate new data points by small perturpations / sampling from distributions, pipe them through the Blackbox and fit a simple model based on these labels.
Hope that makes sense :)
@DanielWeikert 3 ปีที่แล้ว
@@DeepFindr Thanks a lot for your reply. Highly appreciated. What I still don't understand is how the blackbox model is fitted. So are the y values the original predictions on these training samples instead of the ground truth y_train?
@DeepFindr 3 ปีที่แล้ว ⁺¹
@@DanielWeikert sure no problem :)
The Blackbox model is fitted on X_train with the labels y_train.
Now we have a specific data input to our blackbox model - for which the prediction is for example 0. In order to explain this prediction we want to build a simple model in the neighborhood of that point based on the decision boundary of the Blackbox.
The simple model is fitted on sampled similar data points (for example coming from the train data X_train), let's call them Z_train.
The labels for these samples come from the Blackbox model f, so we get y_train_simple = f(Z_train).
The Blackbox model is only fitted once at an early point In time. For the explanations of the predictions we fit each time a new simple model
Best
@kenalexandremeridi 4 หลายเดือนก่อน
What is this for? I am intrigued ( im a musician)
@LittleBigAmbition 2 ปีที่แล้ว
Hi, amazing explanation.
I'm wondering if you could elaborate how the size of proximity is decided upon? How we do know how "large" the neighborhood is for the newly fit linear model? Realistically, we could just choose a few close points and fit a linear model there, but that might not be the best fit? I hope my question makes sense.
@hasnainmamdani4534 2 ปีที่แล้ว
Great explanation! Thanks! Quick question: I was wondering what is the reason behind providing LIME with training dataset (line 29), what does it do with it? I understood that testing dataset (line 32/33) is used to do perturbations and create a simpler model? Thanks!
@DeepFindr 2 ปีที่แล้ว ⁺¹
Hi! This is used to calculate feature statistics when the parameter discretizice_continuous is passed. Otherwise it's simply ignored..
You can have a look here to see how it is used in detail:
github.com/marcotcr/lime/blob/master/lime/lime_tabular.py
@muhammadlimangambo589 2 ปีที่แล้ว
Thank you for the wonderful step by step explanation, it really goes a long way. However, implementing same explanation procedure on a Conv1D CNN model for binary classification throws an error message that "AttributeError: 'Sequential' object has no attribute 'predict_proba'". Please, any replacement for the "predict_proba' method?
@DeepFindr 2 ปีที่แล้ว
Hi, have you tried the recurrent Explainer as suggested here: github.com/marcotcr/lime/issues/426
@muhammadlimangambo589 2 ปีที่แล้ว
No, just seeing it. I'll give it a try.
Thank you for the timely response.
@nitinvats3165 3 ปีที่แล้ว
Really great video. I am trying to implement lime for one of our use cases and features contribution is changing on every run for the same input transaction. Is that the expected behavior? Example- For property price prediction- Property age is contributing 25% and while running the explainability again, without modifying anything contribution is changing to 28%. How can we control it?
@DeepFindr 3 ปีที่แล้ว
Hi! Thanks :)
Mhh because the sampling of new data points includes some randomness, it might be the case that this affects the contribution. It's basically like each time a different dataset is used to calculate the contributions.
You could try to set a random seed and always use the same one.
You can read more about it here: github.com/marcotcr/lime/issues/119
Best regards :)
@nitinvats3165 3 ปีที่แล้ว
@@DeepFindr Thank you. This helps. I will try to set the random seed.
@fahimhasan4905 3 ปีที่แล้ว
Can we use Lime for RNN? Or we need to work on Shap?
@DeepFindr 3 ปีที่แล้ว
As it is model agnostic it should in principle work with any machine learning model, so also RNN :)
However I have never tried to use it and also I think it is difficult to work with a sequence of data inputs.
Here is an example for a LSTM:
towardsdatascience.com/interpreting-an-lstm-through-lime-e294e6ed3a03
@randalllionelkharkrang4047 2 ปีที่แล้ว
Thanks for this. Its funny that "ever_married" feature is inversely proportional to getting a stroke :)
@DeepFindr 2 ปีที่แล้ว
:D true!
@sheetalrajpal4436 3 ปีที่แล้ว
Concepts explained "PERFECTLY" !! Do we have slides of these videos?
@DeepFindr 3 ปีที่แล้ว ⁺¹
Hi :) thanks!
I haven't uploaded them but you can send a mail to deepfindr@gmail.com and I will attach them :)
@sheetalrajpal4436 3 ปีที่แล้ว
@@DeepFindr Thank you so much!
@bigbangize 3 ปีที่แล้ว
Hi! Can LIME be used for Unsupervised approaches?
@DeepFindr 3 ปีที่แล้ว
Hi! I think it's possible. For approximating the model you need to have labels to compare against. If you are able to construct these labels for an unsupervised approach it works.
Such a thing could be the probability of being assigned to a cluster. With that information I think lime can be used.
For SHAP you can use Isolation Forest as unsupervised model being explained by the Tree Explainer.
@kalpaashhar6522 3 ปีที่แล้ว
Hey thanks for the clean explanation! I wanted to know if you have tried LIME with multiple class classification problem, aside from the binary classification already in the video? Thanks
@DeepFindr 3 ปีที่แล้ว ⁺¹
Hi!
I haven't tried it before, but maybe this website is useful for you:
coderzcolumn.com/tutorials/machine-learning/how-to-use-lime-to-understand-sklearn-models-predictions#lime_tabular_ex3
They have an example for a multiclass classification case :)
Thanks!
@kalpaashhar6522 3 ปีที่แล้ว
@@DeepFindr thank you!!
@parteekbhatia 3 ปีที่แล้ว
Hi, DeepFindr
First of all sorry for hurting your sentiments as a developer. Actually, i have created the video for sharing among the closed group as part of a series of video sessions, and in these sessions, I properly cited your work and mention the source, But mistakenly I set the visibility of this video as public without noticing that it is not referencing any source. I feel sorry for this mistake.
I have removed the video for the public, However, I have added the references and sources. And moreover, i will edit it and will include it in the video itself.
I am a great fan of you and love the way you have explained XAI and in no way want to hurt my Guru. I again feel sorry for my unintended action and thanks for pointing and correcting me.
Keep doing good work.
Wish you all the best,
@DeepFindr 3 ปีที่แล้ว
Hey! No worries, it's all fine :)
At first I was a bit surprised, but then I thought it's not really a problem, as your goal is to share knowledge and that's what it's all about. You can keep everything public as you wish.
Thanks for your kind words! Wish you all the best as well!
@Lukaas_95 3 ปีที่แล้ว
Thank you very much for this excellent explanation. One short question: What is the difference between z and z'?
@DeepFindr 3 ปีที่แล้ว ⁺¹
Thank you!
z' is a perturbed sample and z is the original representation.
Thanks for pointing this out, I somehow missed it when creating this video.
Z is then the full sample (passed to the Blackbox) and z' contains only a fraction of the non-zero elements and goes to the surrogate model.
The idea is to get simpler data points that make it easier to explain predictions.
@Lukaas_95 3 ปีที่แล้ว
@@DeepFindr Thank you for your very fast answer! So:
Instance to be explained --> x
Random perturbations based on x --> Z
A sample of the random perturbations Z --> z'
Am I right?...
@DeepFindr 3 ปีที่แล้ว ⁺¹
Hi, yes more or less. For the last relation I would maybe call it "simplified version" of z.
So x is the original data point, x' it's simplified version.
Therefore z is a perturbed sample and z' it's simplified version.
For images for example they use superpixels as interpretable (simplified) version. For text data they use a (sampled) bag of words representation.
But to get these simplified versions they sample again, as you said :)
@Lukaas_95 3 ปีที่แล้ว
@@DeepFindr Thank you! :-)
@moawiaserdar4143 3 ปีที่แล้ว
Thank you for the amazing explanation!
Have you by any chance encountered researches regarding XAI in industrial applications, such as threaded fastening?
@DeepFindr 3 ปีที่แล้ว
Hi! I think there is a lot of potential for XAI in the industry. Some years ago I also did a project on material sciences. A company produced products to damp vibration and squealing. Basically they wanted to know which material combinations lead to a certain behavior and why. But I have no publication for that, because of two reasons:
- I used counterfactual explanations and had my own implementation but didn't really contribute to / extend current research
- Most companies want to keep such advances secret to have a competitive advantage
And I think these are the reasons why there is generally not too much available in that direction. There are some meta papers, like this one
arxiv.org/abs/2106.07758
but it's difficult to find very specific applications.
Best regards
@andreashorlbeck8749 2 ปีที่แล้ว
perfect!
@maegancalvert59 2 ปีที่แล้ว
Wonderful explanation!!! Thank you!!!
@mikehawk6690 3 ปีที่แล้ว
nice video
@RishabNakarmi-rn 4 หลายเดือนก่อน
Did not get :( moving to other videos

ต่อไป

เล่นอัตโนมัติ