PHM Data Challenge
Call for Participation
Final Results
(Updated 07-Aug-2014)
Rank | Team | Score |
---|---|---|
1 | diligent | 1.15840 |
2 | JohnCorner | 1.14340 |
3 | Master | 1.09810 |
4 | Better Than Guess | 1.07430 |
5 | july827714 | 1.06850 |
6 | BUAA-RMS | 1.06600 |
7 | young | 1.06200 |
8 | dark | 1.05460 |
9 | Solve for [x] | 1.05110 |
10 | Direwolves | 1.05020 |
11 | bayes’ vigilantes | 1.03660 |
12 | nicta | 1.03250 |
13 | R707 | 1.03020 |
14 | dream | 1.02830 |
15 | NRCCanada | 1.02490 |
16 | burrito | 1.01920 |
17 | Phoenix | 1.01730 |
18 | DSUC | 1.00870 |
19 | Ocelot | 0.99890 |
20 | miccha-ditthi | 0.99360 |
Teams
Collaboration is encouraged and teams may be comprised of one or more students and/or professionals. The team judged to have the first and second best scores will be awarded prizes of $600 and $400 respectively contingent upon:
- Having at least one member of the team attend the PHM 2014 Conference
- Presenting the analysis results and technique employed at a special session within the Conference program
- Submitting a peer-reviewed Conference paper. (Submission of the challenge special session papers is outside the regular paper submission process and follows its own modified schedule.)
- The top entries will also be encouraged to submit a journal-quality paper to the International Journal of Prognostics and Health Management (ijPHM).
The organizers of the competition reserve the right to both modify these rules and disqualify any team for any practices it deems inconsistent with fair and open practices.
Registration
Teams may register by contacting the Competition organizers (dustin.garvey@gmail.com, robert.wigny@pwc.ca) with their name(s) and a team alias under which the scores would be posted.
Please note: In the spirit of fair competition, we allow only one account per team. Please do not register multiple times under different user names, under fictitious names, or using anonymous accounts. Competition organizers reserve the right to delete multiple entries from the same person (or team) and/or to disqualify those who are trying to “game” the system or using fictitious identities.
Key Dates
Key Conference Dates | |
---|---|
Competition Closed | 5 Aug 2014 |
Preliminary Winners Announced | 6 Aug 2014 |
Winning Papers Due | 14 Sep 2014 |
Winners Announced | 21 Sep 2014 |
PHM Conference Dates | 29 Sep – 2 Oct 2014 |
Data
There are 5 data sets. Due to proprietary concerns we cannot provide a detailed description of the data and the domain. If you have any questions, please contact Dustin Garvey (dustin.garvey@gmail.com).
Train – Part Consumption.csv
This file contains a record of what parts were replaced on an asset and the reason for their replacement. This file contains five columns. The first column is the asset ID. The second column is the time that the part(s) were replaced. The third column is a code that specifies the reason the asset was being worked on. The fourth column is a code that specifies the type of part that was replaced. The fifth column is the number of parts of the specified type that were replaced.
Train – Usage.csv
This file contains a record of a parameter that roughly measures asset usage. The units of this measure cannot be shared due to proprietary concerns, but think something along the lines of an odometer. This file contains three columns. The first column is the asset ID. The second column is the time the measurement was taken. The third column is the current usage of the asset.
Train – Failures.csv
This file contains a set of asset failures. The file contains two columns. The first is an asset ID and the second is the time of the failure. The units of time cannot be shared.
Test – Part Consumption.csv
This file contains the same type of information as the training file but over the test period.
Test – Usage.csv
This file contains the same type of information as the training file but over the test period.
Test Instances.csv
This file contains the asset ID and time combinations that should be used to generate the health scores in the submission. This file contains two columns. The first column is the asset ID. The second column is the time relative to which the health score should be calculated.
Example Submission.csv
This file contains an example submission with specified a segmentation of health scores of 0.9 (lower than 0.9 = low risk, greater than or equal to 0.9 = high risk) and randomly generated scores for the test instances.
Submission & Scoring
Each team is permitted one submission a week. A submission will be composed of a CSV file whose name is the team alias. If the alias is “example”, then the filename will be “example.csv”. The first line of the CSV file should be the value that segments the health scores into low and high risk. More specifically assets are low risk will have scores below the supplied limit and high risk will have scores above the limit. The remaining lines should have three columns for the prescribed set of assets and times. The first and second column should be prescribed asset ID and calculation times. The third column should be a health score, with 0 meaning a healthy asset and increasingly large values meaning increasingly degraded.
The test assets and times contain a balanced sample of instances where the asset did and did not fail in the immediate future (within 3 time units). If N is the number of samples of each type (i.e. with and without a failure in the immediate future, total samples is 2N), L is the number without an immediate failure and placed in the low risk category, and H is the number with an immediate failure and placed in the high risk category, the score of the submission is: