TY - JOUR
T1 - Regularization techniques for learning with matrices
AU - Kakade, Sham M.
AU - Shalev-Shwartz, Shai
AU - Tewari, Ambuj
PY - 2012/6
Y1 - 2012/6
N2 - There is growing body of learning problems for which it is natural to organize the parameters into a matrix. As a result, it becomes easy to impose sophisticated prior knowledge by appropriately regularizing the parameters under some matrix norm. This work describes and analyzes a systematic method for constructing such matrix-based regularization techniques. In particular, we focus on how the underlying statistical properties of a given problem can help us decide which regularization function is appropriate. Our methodology is based on a known duality phenomenon: a function is strongly convex with respect to some norm if and only if its conjugate function is strongly smooth with respect to the dual norm. This result has already been found to be a key component in deriving and analyzing several learning algorithms. We demonstrate the potential of this framework by deriving novel generalization and regret bounds for multi-task learning, multi-class learning, and multiple kernel learning.
AB - There is growing body of learning problems for which it is natural to organize the parameters into a matrix. As a result, it becomes easy to impose sophisticated prior knowledge by appropriately regularizing the parameters under some matrix norm. This work describes and analyzes a systematic method for constructing such matrix-based regularization techniques. In particular, we focus on how the underlying statistical properties of a given problem can help us decide which regularization function is appropriate. Our methodology is based on a known duality phenomenon: a function is strongly convex with respect to some norm if and only if its conjugate function is strongly smooth with respect to the dual norm. This result has already been found to be a key component in deriving and analyzing several learning algorithms. We demonstrate the potential of this framework by deriving novel generalization and regret bounds for multi-task learning, multi-class learning, and multiple kernel learning.
KW - Generalization bounds
KW - Multi-class learning
KW - Multi-task learning
KW - Multiple kernel learning
KW - Regret bounds
KW - Regularization
KW - Strong convexity
UR - http://www.scopus.com/inward/record.url?scp=84859411801&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:84859411801
SN - 1532-4435
VL - 13
SP - 1865
EP - 1890
JO - Journal of Machine Learning Research
JF - Journal of Machine Learning Research
ER -