I plotted the learning curves using micro and macro F-scores for a Multinomial Naive Bayes classifier.
The first plot is made using micro F-score, and the second using macro F-score. I find it quite difficult to interpret both of them. The explanations I found for learning curves use errors.
I think in the first plot there is variability up to around 300 instances, and then it starts to converge until the point when the lines are parallel. Therefore, even adding more data would not help. But what about bias?
Also can you please explain to me the difference between the plots in terms of micro and macro F-scores?