Effect of over-sampling versus under-sampling for SVM and LDA classifiers for activity recognition
Price
Free (open access)
Volume
Volume 11 (2016), Issue 3
Pages
10
Page Range
306 - 316
Paper DOI
10.2495/DNE-V11-N3-306-316
Copyright
WIT Press
Author(s)
M.B ABIDINE, B. FERGANI & F.J ORDÓÑEZ
Abstract
Accurately recognizing the rare activities from sensor network-based smart homes for monitoring the elderly person is a challenging task. Activity recognition datasets are generally imbalanced, meaning certain activities occur more frequently than others. Not incorporating this class imbalance results in an evaluation that may lead to disastrous consequences for elderly persons. To overcome this problem, we evaluate two resam- pling methods using Over-sampling (OS) and Under-sampling (US). Then, these methods were combined with the discriminative classifiers named support vector machines (SVM) and linear discriminant analysis (LDA). experimental results carried out on multiple real-world smart home datasets demonstrate the feasibility of the proposal. Besides, a comparison with some state–of-the-art techniques based on Conditional Random Field (CRF) and Hidden Markov Model (HMM), we demonstrate that the US-SVM and OS-LDA are able to surpass HMM, CRF, SVM, LDA, OS-SVM and US-LDA. However, OS-LDA is the most effective method in terms of recognition of activities.
Keywords
humanactivity recognition, imbalanced data, LDA, machine learning, SVM