Of all the possible causes of X, which ones are most important?
estimate the arrow strength! see dowhy link
Shuffling method
then compute the importance $\Delta$ , which is how much error comes from removing the feature vs the largest possible error
$\Delta = \frac{$\epsilon_{minus \ i} - $\epsilon_{complete}}{$\epsilon_{baseline} - \epsilon_{complete}}$
the max importance is $\Delta = 1.0$, when removing the feature increases the error as much as removing all the features
train the model, then compute the importance for each feature using the model. interpretation is that this is the arrow strength according to this particular model