Challenge 2 projects

From Epidemium
Jump to: navigation, search
Challenge2 900px.png
Challenge 2: Foresee cancers

Prediction challenge (the incidence of cancers)
Developing a predictive tool for the progression of cancer in time and space,depending on the known or supposed factors that determine its evolution.
The challenge consists in developing of a predictive tool for the incidence of cancers.The data used to learn the models are detailed below.
The Training set will be composed of the data covering the period from 1950 to 2003.The validation set will be composed of the data covering the period from 2003 to 2007.Note that the data from 2007 to 2012 in the process of being obtained and will be Integrated into the validation set.
This challenge is divided into two distinct parts: 1/ prediction in the world, 2/ prediction by country.

The winning team will be the one whose prediction will be the most efficient on the validation set. A leaderboard will be set up with real-time updating.


The registered projects:

Opencancer

Time-series analysis and forecast models on cancer incidence
Aim: develop a prediction model of colon and ovary cancer.

Associated skills: Data cleaning, Database Management Systems, Relationnal Database Management Systems, Mathematics, Statistic, Rules of Professional Responsibility and Ethical Obligation.

Aim: provide a study on time-series analysis and forecast models on cancer incidence (Machine Learning approaches).

Associated skills: Python, Java, C, C++, R, SQL, NOSQL, Machine learning, Deep learning, Elastic Search, Data preparation, Data cleaning, Data visualization, Mathematics.


Cancer and local predictions Baseline 2nd Edition
Aim: optimize the understanding and prediction of stand-alone cancers under certain circumstances (geographical e.g.).

Associated skills: Distributed Databases, Deep learning, Machine learning, Data visualization, Mathematics, Statistic, Biology, Oncology, Virology, Geriatrics, Treatment Services, Hierarchical Database Systems, Data cleaning.

Aim: create an online platform hosting an interactive, robust, ready-to-use dataset. This platform should ideally be linked with a data science / machine learning environment to easily design and test statistical models.

Associated skills: Data cleaning, Data preparation, Machine learning, Database Management Systems, Mathematics, Statistic, UI Design, Web design, Graphics, Oncology, SQL.