Opened 10 years ago
Closed 9 years ago
#2335 closed enhancement (done)
Data Preprocessing Improvements
Reported by: | ehopf | Owned by: | mkommend |
---|---|---|---|
Priority: | medium | Milestone: | HeuristicLab 3.3.12 |
Component: | DataPreprocessing | Version: | 3.3.12 |
Keywords: | Cc: |
Description (last modified by ehopf)
Various data preprocessing improvements especially for data sets with many missing values. Furthermore this includes fixes for found defects in the data preprocessing components.
Defects
(1) Problem data generation fails with an Exception, if a user deletes one or more columns. (2) The generated classification problem data doesn´t keep the user defined positive class. (3) Editing a cell in the DataGrid-View leads to an Exception. (4) Exception if a user changes to the Histogram-View after he deletes a column (only occurs if the user was already in the Histogram-View before the deletion). (5) Manipulation-View: Wrong column count in the preview of "Delete Columns with insufficient Information" and "Delete Columns with insufficient Variance". (6) The Statistics-View doesn´t update the statistics if a Filter is applied to the data. (7) Deactive Export Problem Button. (8) Unexptected row delete behavior if the data is sorted (Datagrid-View). (9) An Exception occurs if the user deletes more than half of the rows in the DataGrid-View.
Features
(1) An option to display missing values within the Histogram view. (2) A possibility to delete columns in the DataGrid view. (3) Display Missing value information in the statistics available in the DataGrid view.
Change History (42)
comment:1 Changed 10 years ago by ehopf
- Version changed from 3.3.11 to branch
comment:2 Changed 10 years ago by ehopf
- Status changed from new to accepted
comment:3 Changed 10 years ago by ehopf
comment:4 Changed 10 years ago by ehopf
r12052: created the HL.DataPreprocessing project subfolder in the DataPreprocessing branch
comment:5 Changed 10 years ago by ehopf
r12053: moved the HL.DataPreprocessing project to the appropriate subfolder
comment:6 Changed 10 years ago by ehopf
r12054: Branched HeuristicLab.DataPreprocessing.Views to implement the improvements
comment:7 Changed 10 years ago by ehopf
r12056: Created a solution file and adjusted the project settings of the DataPreprocessing Branch
comment:8 Changed 10 years ago by ehopf
comment:9 Changed 10 years ago by ehopf
- Description modified (diff)
comment:10 Changed 10 years ago by ehopf
comment:11 Changed 10 years ago by ehopf
- Description modified (diff)
comment:12 Changed 10 years ago by ehopf
comment:13 Changed 10 years ago by ehopf
- Description modified (diff)
comment:14 Changed 10 years ago by ehopf
comment:15 Changed 10 years ago by ehopf
r12160: Branched HL.Data.Views to implement minor improvements to the StringConvertibleMatrixView-class.
comment:16 Changed 10 years ago by ehopf
r12161: Adjusted the project settings of the DataPreprocessing Branch.
comment:17 Changed 10 years ago by ehopf
r12164: Adjusted the project settings of the DataPreprocessing Branch.
comment:18 Changed 10 years ago by ehopf
r12165: Encapsulated sort column and statistics generation behavior in StringConvertibleMatrixView.cs. Additionally fixed the statistic measures regarding missing values.
comment:19 Changed 10 years ago by ehopf
r12167: Swapped column left- and rightclick behavior in the DataGrid view, to be consistent with the row selection behavior.
comment:20 Changed 10 years ago by ehopf
comment:21 Changed 10 years ago by ehopf
comment:22 Changed 9 years ago by ehopf
- Description modified (diff)
comment:23 Changed 9 years ago by ehopf
comment:24 Changed 9 years ago by ehopf
comment:25 Changed 9 years ago by ehopf
comment:26 Changed 9 years ago by ehopf
- Description modified (diff)
comment:27 Changed 9 years ago by ehopf
- Description modified (diff)
comment:28 Changed 9 years ago by ehopf
comment:29 Changed 9 years ago by ehopf
comment:30 Changed 9 years ago by ehopf
comment:31 Changed 9 years ago by ehopf
- Description modified (diff)
comment:32 Changed 9 years ago by ehopf
r12555: Removed a redundant check and fixed a validation problem in the Datagrid-View (Defect 9). Additionally changed the validation in the Manipulation-View to disallow the thousands separator as input. This prevents the unintended usage of the thousands separator as comma which would result in a wrong result.
comment:33 Changed 9 years ago by ehopf
r12633: Merged trunk into DataPreprocessingImprovements-Branch.
comment:34 Changed 9 years ago by ehopf
- Owner changed from ehopf to mkommend
- Status changed from accepted to reviewing
comment:35 Changed 9 years ago by mkommend
- Version changed from branch to 3.3.12
r12676: Merged changes into trunk.
comment:36 Changed 9 years ago by mkommend
r12677: Deleted DataPreprocessingImprovements branch.
comment:37 Changed 9 years ago by mkommend
- Owner changed from mkommend to gkronber
In my opinion the changes are ready for release.
comment:38 Changed 9 years ago by mkommend
r12682: Corrected accidentally merged project files.
comment:39 Changed 9 years ago by mkommend
r12683: Corrected accidentally merged project files (second try).
comment:40 Changed 9 years ago by gkronber
I didn't make a thorough review of all changes but I tested the data preprocessing view a little bit and nothing strange occurred. So I guess we can merge this to stable.
comment:41 Changed 9 years ago by gkronber
- Owner changed from gkronber to mkommend
- Status changed from reviewing to readytorelease
comment:42 Changed 9 years ago by mkommend
- Resolution set to done
- Status changed from readytorelease to closed
r12718: Merged all changes into stable.
r12051: Branched HL.DataPreprocessing to implement improvements