Opened 5 months ago

Last modified 3 months ago

#2697 accepted enhancement

Refactor certain parts for data-based modeling

Reported by: gkronber Owned by: gkronber
Priority: medium Milestone: HeuristicLab 3.3.15
Component: Problems.DataAnalysis Version: 3.3.14
Keywords: Cc:

Description

  • Move translation to AutoDiff out of evaluator
  • Create ISymbolicExpressionTree for linear model from vector of coefficients
  • Create two-dimensional array from ProblemData

Change History (11)

comment:1 Changed 5 months ago by gkronber

  • Owner set to gkronber
  • Status changed from new to accepted

comment:2 Changed 5 months ago by gkronber

r14378:

  • created a folder for all classes related to transformation from and to trees
  • created a transformator which takes a tree and uses AutoDiff to produce a function and gradient function for the tree.
  • moved code from SymbolicRegressionConstantOptimizationEvaluator to TreeToAutoDiffTermTransformator to make AutoDiff for trees more accessible

comment:3 Changed 4 months ago by gkronber

r14390:

  • renaming of folder "Transformation" to "Converters" to distinguish between transformations for variables (from data preprocessing) and classes for transformation of trees.
  • renamed SymbolicDataAnalysisExpressionTreeSimplifier -> TreeSimplifier
  • Implemented a converter to create a linar model as a symbolic expression tree

comment:4 Changed 4 months ago by gkronber

r14391:

  • extended converter for linear models to support lagged variables and changed AR(k) to use this method

comment:5 Changed 4 months ago by gkronber

Todo: remove class scaling (needs persistence backwards compat in GPR)

comment:6 Changed 4 months ago by gkronber

r14393:

  • removed AlglibUtil.cs and added an extension method .ToArray(names, rows) to IDataset instead.
  • refactored transformation so that it is possible to apply an transformation without resetting the parameters
  • Used transformations instead of Scaling as far as possible.
  • Moved TakeEvery extension method to HL.Common

comment:7 Changed 4 months ago by gkronber

r14394: renamed TreeSimplifier in unit test

comment:8 Changed 4 months ago by gkronber

r14396: added methods to get training and test input matrices from ProblemData

comment:9 Changed 4 months ago by gkronber

r14400: reverse merge of r14378, r14390, r14391, r14393, r14394, r14396 because the changes cannot be merged over to the #2650 branch

comment:10 Changed 4 months ago by gkronber

This is needed by #745.

However, #2650 should be merged to the trunk first before we apply the changes reverted in r14400 because there are many conflicting changes.

comment:11 Changed 3 months ago by mkommend

r14507: Added caching of parameters and removed resetting of training and test partition in AdjustProblemDataProperties (used for changing the problem data of a solution) in DataAnalysisProblemData.

Note: See TracTickets for help on using tickets.