POTD: Wikipedia (Extracting text descriptions)

I was trying to use for a pet project, where I am required to extract the images, the corresponding description of the image as given in the POTD and the entity mentions in the description.

Hence I request to help me with a way to extract the description from the images or any parameter I can add to the request for doing the same.

Thank you very much.

You are more likely to get help if you ask specific questions (explain what you have tried and where you are currently stuck).