At this moment, I am able to use NN to identify object such as human when given a frame from the camera. Once locate the object, then I can feed the human object image to either NN that's designed to classify male or female.
Let's say I get 1 frame per second from camera and perform detection, the objective is to track number of male and female walk pass the camera within the given hours.
My question is, the same person in multiple frames will be over counted. I couldn't wrap my head around how can I train a NN to understand that this is the same person without dive into facial recognition? I'm sure there is some tracking technique that I just don't know.
One little constraint, if the person left the camera frame and come back into it later, it is fine to treat it as two people.
Any help or direction will help!
Thank you all in advanced!