20,000 Image caption data of gestures, mainly for young and middle-aged people, the collection environment includes indoor scenes and outdoor scenes, including various collection environments, various seasons, and various collection angles. The description language is English, mainly describing hand characteristics such as hand movements, gestures, image acquisition angles, gender, age, etc.
For more details, please refer to the link: https://www.nexdata.ai/datasets/llm/1287?source=Github
10,000 images
Asian
male and female
mainly young and middle-aged
including indoor and outdoor scenes
multiple age groups, multiple collection environments, multiple seasons, multiple camera angles
the image data format is .jpg, the text format is .txt
English, Chinese
in principle, it is 30~60 words, and usually contains 3-5 sentences
hand movement, gesture posture, image acquisition angle, character gender, age
the proportion of correctly labeled images is not less than 97%
Commercial License