Opened 6 years ago
Closed 6 years ago
#2923 closed enhancement (done)
Penn ML Benchmark Problem Instances
Reported by: | bburlacu | Owned by: | gkronber |
---|---|---|---|
Priority: | medium | Milestone: | HeuristicLab 3.3.16 |
Component: | Problems.Instances | Version: | trunk |
Keywords: | Cc: |
Description (last modified by bburlacu)
The EpistasisLab at the University of Pennsylvania maintains a curated set of benchmark datasets for classification and regression [1].
A large part of the regression datasets have already been used for an in-depth comparison of symbolic regression methods [2].
This ticket will track integration of these benchmark problems in HeuristicLab via a new problem instance provider.
Attachments (3)
Change History (11)
comment:1 Changed 6 years ago by bburlacu
- Description modified (diff)
- Status changed from new to accepted
comment:2 Changed 6 years ago by bburlacu
- Owner changed from bburlacu to gkronber
- Status changed from accepted to reviewing
comment:3 Changed 6 years ago by bburlacu
r15952: Fix namespace.
comment:4 Changed 6 years ago by gkronber
I made a quick test using our GBT implementation with standard settings and 3000 iterations.
See the two attachments.
Changed 6 years ago by gkronber
Changed 6 years ago by gkronber
comment:5 Changed 6 years ago by gkronber
- Status changed from reviewing to readytorelease
comment:6 Changed 6 years ago by gkronber
comment:7 Changed 6 years ago by gkronber
comment:8 Changed 6 years ago by gkronber
- Resolution set to done
- Status changed from readytorelease to closed
Note: See
TracTickets for help on using
tickets.
r15951: Add PennML problems and implement instance provider. A reusable descriptor was also implemented, taking advantage of the structure of the data (by convention, target is named "target" and is always the last column). The instance provider generates the descriptors on the fly from the archive.