Opened 11 years ago
Closed 11 years ago
#2335 closed enhancement (done)
Data Preprocessing Improvements
| Reported by: | ehopf | Owned by: | mkommend |
|---|---|---|---|
| Priority: | medium | Milestone: | HeuristicLab 3.3.12 |
| Component: | DataPreprocessing | Version: | 3.3.12 |
| Keywords: | Cc: |
Description (last modified by ehopf)
Various data preprocessing improvements especially for data sets with many missing values. Furthermore this includes fixes for found defects in the data preprocessing components.
Defects
(1) Problem data generation fails with an Exception, if a user deletes one or more columns. (2) The generated classification problem data doesn´t keep the user defined positive class. (3) Editing a cell in the DataGrid-View leads to an Exception. (4) Exception if a user changes to the Histogram-View after he deletes a column (only occurs if the user was already in the Histogram-View before the deletion). (5) Manipulation-View: Wrong column count in the preview of "Delete Columns with insufficient Information" and "Delete Columns with insufficient Variance". (6) The Statistics-View doesn´t update the statistics if a Filter is applied to the data. (7) Deactive Export Problem Button. (8) Unexptected row delete behavior if the data is sorted (Datagrid-View). (9) An Exception occurs if the user deletes more than half of the rows in the DataGrid-View.
Features
(1) An option to display missing values within the Histogram view. (2) A possibility to delete columns in the DataGrid view. (3) Display Missing value information in the statistics available in the DataGrid view.
Change History (42)
comment:1 Changed 11 years ago by ehopf
- Version changed from 3.3.11 to branch
comment:2 Changed 11 years ago by ehopf
- Status changed from new to accepted
comment:3 Changed 11 years ago by ehopf
comment:4 Changed 11 years ago by ehopf
r12052: created the HL.DataPreprocessing project subfolder in the DataPreprocessing branch
comment:5 Changed 11 years ago by ehopf
r12053: moved the HL.DataPreprocessing project to the appropriate subfolder
comment:6 Changed 11 years ago by ehopf
r12054: Branched HeuristicLab.DataPreprocessing.Views to implement the improvements
comment:7 Changed 11 years ago by ehopf
r12056: Created a solution file and adjusted the project settings of the DataPreprocessing Branch
comment:8 Changed 11 years ago by ehopf
comment:9 Changed 11 years ago by ehopf
- Description modified (diff)
comment:10 Changed 11 years ago by ehopf
comment:11 Changed 11 years ago by ehopf
- Description modified (diff)
comment:12 Changed 11 years ago by ehopf
comment:13 Changed 11 years ago by ehopf
- Description modified (diff)
comment:14 Changed 11 years ago by ehopf
comment:15 Changed 11 years ago by ehopf
r12160: Branched HL.Data.Views to implement minor improvements to the StringConvertibleMatrixView-class.
comment:16 Changed 11 years ago by ehopf
r12161: Adjusted the project settings of the DataPreprocessing Branch.
comment:17 Changed 11 years ago by ehopf
r12164: Adjusted the project settings of the DataPreprocessing Branch.
comment:18 Changed 11 years ago by ehopf
r12165: Encapsulated sort column and statistics generation behavior in StringConvertibleMatrixView.cs. Additionally fixed the statistic measures regarding missing values.
comment:19 Changed 11 years ago by ehopf
r12167: Swapped column left- and rightclick behavior in the DataGrid view, to be consistent with the row selection behavior.
comment:20 Changed 11 years ago by ehopf
comment:21 Changed 11 years ago by ehopf
comment:22 Changed 11 years ago by ehopf
- Description modified (diff)
comment:23 Changed 11 years ago by ehopf
comment:24 Changed 11 years ago by ehopf
comment:25 Changed 11 years ago by ehopf
comment:26 Changed 11 years ago by ehopf
- Description modified (diff)
comment:27 Changed 11 years ago by ehopf
- Description modified (diff)
comment:28 Changed 11 years ago by ehopf
comment:29 Changed 11 years ago by ehopf
comment:30 Changed 11 years ago by ehopf
comment:31 Changed 11 years ago by ehopf
- Description modified (diff)
comment:32 Changed 11 years ago by ehopf
r12555: Removed a redundant check and fixed a validation problem in the Datagrid-View (Defect 9). Additionally changed the validation in the Manipulation-View to disallow the thousands separator as input. This prevents the unintended usage of the thousands separator as comma which would result in a wrong result.
comment:33 Changed 11 years ago by ehopf
r12633: Merged trunk into DataPreprocessingImprovements-Branch.
comment:34 Changed 11 years ago by ehopf
- Owner changed from ehopf to mkommend
- Status changed from accepted to reviewing
comment:35 Changed 11 years ago by mkommend
- Version changed from branch to 3.3.12
r12676: Merged changes into trunk.
comment:36 Changed 11 years ago by mkommend
r12677: Deleted DataPreprocessingImprovements branch.
comment:37 Changed 11 years ago by mkommend
- Owner changed from mkommend to gkronber
In my opinion the changes are ready for release.
comment:38 Changed 11 years ago by mkommend
r12682: Corrected accidentally merged project files.
comment:39 Changed 11 years ago by mkommend
r12683: Corrected accidentally merged project files (second try).
comment:40 Changed 11 years ago by gkronber
I didn't make a thorough review of all changes but I tested the data preprocessing view a little bit and nothing strange occurred. So I guess we can merge this to stable.
comment:41 Changed 11 years ago by gkronber
- Owner changed from gkronber to mkommend
- Status changed from reviewing to readytorelease
comment:42 Changed 11 years ago by mkommend
- Resolution set to done
- Status changed from readytorelease to closed
r12718: Merged all changes into stable.



r12051: Branched HL.DataPreprocessing to implement improvements