TY - GEN
T1 - Controlling attribute effect in linear regression
AU - Calders, Toon
AU - Karim, Asim A.
AU - Kamiran, Faisal
AU - Ali, Wasif Mohammad
AU - Zhang, Xiangliang
N1 - KAUST Repository Item: Exported on 2020-10-01
PY - 2013/12
Y1 - 2013/12
N2 - In data mining we often have to learn from biased data, because, for instance, data comes from different batches or there was a gender or racial bias in the collection of social data. In some applications it may be necessary to explicitly control this bias in the models we learn from the data. This paper is the first to study learning linear regression models under constraints that control the biasing effect of a given attribute such as gender or batch number. We show how propensity modeling can be used for factoring out the part of the bias that can be justified by externally provided explanatory attributes. Then we analytically derive linear models that minimize squared error while controlling the bias by imposing constraints on the mean outcome or residuals of the models. Experiments with discrimination-aware crime prediction and batch effect normalization tasks show that the proposed techniques are successful in controlling attribute effects in linear regression models. © 2013 IEEE.
AB - In data mining we often have to learn from biased data, because, for instance, data comes from different batches or there was a gender or racial bias in the collection of social data. In some applications it may be necessary to explicitly control this bias in the models we learn from the data. This paper is the first to study learning linear regression models under constraints that control the biasing effect of a given attribute such as gender or batch number. We show how propensity modeling can be used for factoring out the part of the bias that can be justified by externally provided explanatory attributes. Then we analytically derive linear models that minimize squared error while controlling the bias by imposing constraints on the mean outcome or residuals of the models. Experiments with discrimination-aware crime prediction and batch effect normalization tasks show that the proposed techniques are successful in controlling attribute effects in linear regression models. © 2013 IEEE.
UR - http://hdl.handle.net/10754/564826
UR - http://ieeexplore.ieee.org/document/6729491/
UR - http://www.scopus.com/inward/record.url?scp=84894660134&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2013.114
DO - 10.1109/ICDM.2013.114
M3 - Conference contribution
SN - 9780769551081
SP - 71
EP - 80
BT - 2013 IEEE 13th International Conference on Data Mining
PB - Institute of Electrical and Electronics Engineers (IEEE)
ER -