You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Predictive models with a lot of heterogeneity in the training data often include features that measure the dispersion/variation of each predictor. For example, in addition to a building's year built, you might also include the standard deviation of the year built, where the aggregation group is all properties within 1000 meters.
The text was updated successfully, but these errors were encountered:
Basically any continuous feature where we think the variance of the feature will have some effect on local values. Start with char_bldg_sf and char_yrblt. I'd expect both of those to be impactful here, as you can say things like "The average year built for this area is high and the variance is low, indicating this neighborhood is almost entirely new construction (and therefore likely worth more)."
Predictive models with a lot of heterogeneity in the training data often include features that measure the dispersion/variation of each predictor. For example, in addition to a building's year built, you might also include the standard deviation of the year built, where the aggregation group is all properties within 1000 meters.
The text was updated successfully, but these errors were encountered: