I just had a little chat with Jeff Bier, the founder of Embedded Vision and asked him, how he estimates the chances of intelligent image recognition in healthcare. He says it is possible to achive similar success rates as for example the "Inception v-3 (Arxiv)", with a data set, that is big enough in combination with a good neural network architecture.
https://www.embedded-vision.com/jeff-bier-biography