Opened 3 weeks ago

Last modified 9 days ago

#3102 reviewing defect

Enable passing dataset and class names in ClassificationProblemData constructor

Reported by: bburlacu Owned by: mkommend
Priority: medium Milestone: HeuristicLab 3.3.17
Component: Problems.DataAnalysis Version: branch
Keywords: Cc:

Description

Summary

  • Creating a copy/clone of a problem data object is a very common use case within our code base.
  • In the case of classification problems, one would like to preserve class names/positive class value while changing the dataset or its attributes (eg training/test partitions, shuffling).
  • However, this is not easily possible at the moment, forcing the programmer to manually take care of these properties after a new problem data object has been created, leading to cumbersome and more bug-prone code
  • This has already caused problems as seen in #2806 and #3053 (new object is initialized, programmer forgets/is not aware of all the properties that need to be consistent)

This ticket superseeds #2806 and #3053 and will track the required changes in ClassificationProblemData.cs and dependents.

Change History (3)

comment:1 Changed 3 weeks ago by bburlacu

r17835: Add ClassificationProblemData constructor that explicitly takes class names and positive class value arguments, adapt code.

comment:2 Changed 9 days ago by bburlacu

  • Status changed from new to accepted

r17845: Add constructor taking original problem data and new dataset.

comment:3 Changed 9 days ago by bburlacu

  • Owner changed from bburlacu to mkommend
  • Status changed from accepted to reviewing
Note: See TracTickets for help on using tickets.