Sangam: A Confluence of Knowledge Streams

Human Pose and Action Recognition using Negative Space Analysis

Show simple item record

dc.contributor Electrical Engineering
dc.creator Janse Van Vuuren, Michaella
dc.date 2016-06-27T19:03:47Z
dc.date 2016-06-27T19:03:47Z
dc.date 2004-12
dc.date.accessioned 2023-03-03T18:51:30Z
dc.date.available 2023-03-03T18:51:30Z
dc.identifier eprint:296
dc.identifier http://hdl.handle.net/10919/71571
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/282033
dc.description This thesis proposes a novel approach to extracting pose information from image sequences. Current state of the art techniques focus exclusively on the image space occupied by the body for pose and action recognition. The method proposed here, however, focuses on the negative spaces: the areas surrounding the individual. This has resulted in the colour-coded negative space approach, an image preprocessing step that circumvents the need for complicated model fitting or template matching methods. The approach can be described as follows: negative spaces surrounding the human silhouette are extracted using horizontal and vertical scanning processes. These negative space areas are more numerous, and undergo more radical changes in shape than the single area occupied by the figure of the person performing an action. The colour-coded negative space representation is formed using the four binary images produced by the scanning processes. Features are then extracted from the colour-coded images. These are based on the percentage of area occupied by distinct coloured regions as well as the bounding box proportions. Pose clusters are identified using feedback from an independent action set. Subsequent images are classified using a simple Euclidean distance measure. An image sequence is thus temporally segmented into its corresponding pose representations. Action recognition simply becomes the detection of a temporally ordered sequence of poses that characterises the action. The method is purely vision-based, utilising monocular images with no need for body markers or special clothing. Two datasets were constructed using several actors performing different poses and actions. Some of these actions included actors waving their arms, sitting down or kicking a leg. These actions were recorded against a monochrome background to simplify the segmentation of the actors from the background. The actions were then recorded on DV cam and digitised into a data base. The silhouette images from these actions were isolated and placed in a frame or bounding box. The next step was to highlight the negative spaces using a directional scanning method. This scanning method colour-codes the negative spaces of each action. What became immediately apparent is that very distinctive colour patterns formed for different actions. To emphasise the action, different colours were allocated to negative spaces surrounding the image. For example, the space between the legs of an actor standing in a T - pose with legs apart would be allocated yellow, while the space below the arms were allocated different shades of green. The space surrounding the head would be different shades of purple. During an action when the actor moves one leg up in a kicking fashion, the yellow colour would increase. Inversely, when the actor closes his legs and puts them together, the yellow colour filling the negative space would decrease substantially. What also became apparent is that these coloured negative spaces are interdependent and that they influence each other during the course of an action. For example, when an actor lifts one of his legs, increasing the yellow-coded negative space, the green space between that leg and the arm decreases. This interrelationship between colours hold true for all poses and actions as presented in this thesis. In terms of pose recognition, it is significant that these colour coded negative spaces and the way the change during an action or a movement are substantial and instantly recognisable. Compare for example, looking at someone lifting an arm as opposed to seeing a vast negative space changing shape. In a controlled research environment, several actors were instructed to perform a number of different actions. After colour coding the negative spaces, it became apparent that every action can be recognised by a unique colour coded pattern. The challenge is to ascribe a numerical presentation, a mathematical quotation, to extract the essence of what is so visually apparent. The essence of pose recognition and it's measurability lies in the relationship between the colours in these negative spaces and how they impact on each other during a pose or an action. The simplest way of measuring this relationship is by calculating the percentage of each colour present during an action. These calculated percentages become the basis of pose and action recognition. By plotting these percentages on a graph confirms that the essence of these different actions and poses can in fact been captured and recognised. Despite variations in these traces caused by time differences, personal appearance and mannerisms, what emerged is a clear recognisable pattern that can be married to an action or different parts of an action. 7 Actors might lift their left leg, some slightly higher than others, some slower than others and these variations in terms of colour percentages would be recorded as a trace, but there would be very specific stages during the action where the traces would correspond, making the action recognisable.In conclusion, using negative space as a tool in human pose and tracking recognition presents an exiting research avenue because it is influenced less by variations such as difference in personal appearance and changes in the angle of observation. This approach is also simplistic and does not rely on complicated models and templates
dc.format application/pdf
dc.format application/pdf
dc.language en
dc.publisher University of Cape Town
dc.rights In Copyright
dc.rights http://rightsstatements.org/vocab/InC/1.0/
dc.subject Image processing
dc.subject pattern recognition
dc.subject computer vision
dc.subject human pose and action recognition using negative space analysis
dc.subject surveillance
dc.subject chroma-key
dc.subject silhouette
dc.subject preprocessing
dc.subject color coded
dc.subject colour-coded
dc.subject horizontal scanning
dc.subject vertical scanning
dc.subject RGB colour image
dc.subject bounding box
dc.subject multi-view
dc.subject multi view
dc.subject aerobics
dc.subject exercise
dc.subject feature extraction
dc.subject clustering
dc.subject over generalisation
dc.subject over fitting
dc.subject k-means
dc.subject SOM
dc.subject self organising map
dc.subject automatic partitioning
dc.subject pose labelling
dc.subject pose clusters
dc.subject pose classification
dc.subject correlation
dc.subject sequence
dc.subject feedback
dc.subject recognising actions
dc.subject visualization
dc.subject feature plots
dc.subject direct able
dc.subject directable character
dc.subject database
dc.subject animated sequences .
dc.subject QA75
dc.subject NX
dc.subject QA76
dc.subject TA
dc.subject T1
dc.subject TK
dc.subject Q1
dc.title Human Pose and Action Recognition using Negative Space Analysis
dc.type Dissertation


Files in this item

Files Size Format View
296_1.pdf 10.52Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse