Random Forest Classification on Sentinel 2 Mosaic

three bands do not make much sense to me, because the strength of the RF classifier is to shuffle the input bands repeatedly to make use of the most relevant ones. Why are you not using all 12 bands?

Please have a look at the discussions here: Number of training samples at Random forest classifier