On the homogenization of data from two laboratories using Genetic Programming

Jose G. Moreno-Torres, Xavier Llorà, David E. Goldberg, Rohit Bhargava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In experimental sciences, diversity tends to difficult predictive models' proper generalization across data provided by different laboratories. Thus, training on a data set produced by one lab and testing on data provided by another lab usually results in low classification accuracy. Despite the fact that the same protocols were followed, variability on measurements can introduce unforeseen variations that affect the quality of the model. This paper proposes a Genetic Programming based approach, where a transformation of the data from the second lab is evolved driven by classifier performance. A real-world problem, prostate cancer diagnosis, is presented as an example where the proposed approach was capable of repairing the fracture between the data of two different laboratories.

Original languageEnglish (US)
Title of host publicationLearning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers
Pages185-197
Number of pages13
DOIs
StatePublished - Dec 1 2010
Event11th International Workshop on Learning Classifier Systems, IWLCS 2008 and 12th International Workshop on Learning Classifier Systems, IWLCS 2009 - Montreal, QC, Canada
Duration: Jul 9 2009Jul 9 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6471 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th International Workshop on Learning Classifier Systems, IWLCS 2008 and 12th International Workshop on Learning Classifier Systems, IWLCS 2009
CountryCanada
CityMontreal, QC
Period7/9/097/9/09

Fingerprint

Genetic programming
Genetic Programming
Homogenization
Classifiers
Network protocols
Testing
Prostate Cancer
Predictive Model
Classifier
Tend

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Moreno-Torres, J. G., Llorà, X., Goldberg, D. E., & Bhargava, R. (2010). On the homogenization of data from two laboratories using Genetic Programming. In Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers (pp. 185-197). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6471 LNAI). https://doi.org/10.1007/978-3-642-17508-4_12

On the homogenization of data from two laboratories using Genetic Programming. / Moreno-Torres, Jose G.; Llorà, Xavier; Goldberg, David E.; Bhargava, Rohit.

Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers. 2010. p. 185-197 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6471 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Moreno-Torres, JG, Llorà, X, Goldberg, DE & Bhargava, R 2010, On the homogenization of data from two laboratories using Genetic Programming. in Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6471 LNAI, pp. 185-197, 11th International Workshop on Learning Classifier Systems, IWLCS 2008 and 12th International Workshop on Learning Classifier Systems, IWLCS 2009, Montreal, QC, Canada, 7/9/09. https://doi.org/10.1007/978-3-642-17508-4_12
Moreno-Torres JG, Llorà X, Goldberg DE, Bhargava R. On the homogenization of data from two laboratories using Genetic Programming. In Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers. 2010. p. 185-197. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-17508-4_12
Moreno-Torres, Jose G. ; Llorà, Xavier ; Goldberg, David E. ; Bhargava, Rohit. / On the homogenization of data from two laboratories using Genetic Programming. Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers. 2010. pp. 185-197 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{f6de12be40804fadbebf0cad3a658e1d,
title = "On the homogenization of data from two laboratories using Genetic Programming",
abstract = "In experimental sciences, diversity tends to difficult predictive models' proper generalization across data provided by different laboratories. Thus, training on a data set produced by one lab and testing on data provided by another lab usually results in low classification accuracy. Despite the fact that the same protocols were followed, variability on measurements can introduce unforeseen variations that affect the quality of the model. This paper proposes a Genetic Programming based approach, where a transformation of the data from the second lab is evolved driven by classifier performance. A real-world problem, prostate cancer diagnosis, is presented as an example where the proposed approach was capable of repairing the fracture between the data of two different laboratories.",
author = "Moreno-Torres, {Jose G.} and Xavier Llor{\`a} and Goldberg, {David E.} and Rohit Bhargava",
year = "2010",
month = "12",
day = "1",
doi = "10.1007/978-3-642-17508-4_12",
language = "English (US)",
isbn = "3642175074",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "185--197",
booktitle = "Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers",

}

TY - GEN

T1 - On the homogenization of data from two laboratories using Genetic Programming

AU - Moreno-Torres, Jose G.

AU - Llorà, Xavier

AU - Goldberg, David E.

AU - Bhargava, Rohit

PY - 2010/12/1

Y1 - 2010/12/1

N2 - In experimental sciences, diversity tends to difficult predictive models' proper generalization across data provided by different laboratories. Thus, training on a data set produced by one lab and testing on data provided by another lab usually results in low classification accuracy. Despite the fact that the same protocols were followed, variability on measurements can introduce unforeseen variations that affect the quality of the model. This paper proposes a Genetic Programming based approach, where a transformation of the data from the second lab is evolved driven by classifier performance. A real-world problem, prostate cancer diagnosis, is presented as an example where the proposed approach was capable of repairing the fracture between the data of two different laboratories.

AB - In experimental sciences, diversity tends to difficult predictive models' proper generalization across data provided by different laboratories. Thus, training on a data set produced by one lab and testing on data provided by another lab usually results in low classification accuracy. Despite the fact that the same protocols were followed, variability on measurements can introduce unforeseen variations that affect the quality of the model. This paper proposes a Genetic Programming based approach, where a transformation of the data from the second lab is evolved driven by classifier performance. A real-world problem, prostate cancer diagnosis, is presented as an example where the proposed approach was capable of repairing the fracture between the data of two different laboratories.

UR - http://www.scopus.com/inward/record.url?scp=79956294553&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79956294553&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-17508-4_12

DO - 10.1007/978-3-642-17508-4_12

M3 - Conference contribution

AN - SCOPUS:79956294553

SN - 3642175074

SN - 9783642175077

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 185

EP - 197

BT - Learning Classifier Systems - 11th International Workshop, IWLCS 2008 and 12th International Workshop, IWLCS 2009, Revised Selected Papers

ER -