Hello everybody,
I am trying to estimate consumption data, by applying three methods (only one or a combination). I need to estimate only a max between 20% of the total period for which actual data is reported (for the two whole years that I am interested in 2020-2021) and 93 days. We can have consumption outside the active period for one meter (it’s historical data), but I estimate only the active period, within the two years. The methods are:
- If data missing for one month is 15 days or less - estimated consumption will be equal to actual consumption/number of days for which we have consumption*missing number of days.
- If we have missing for example four months' worth of data for 2020 (Jan-April), but we have full data for 2019, then we compare the consumption for May-Dec 2020 with 2019 and we apply the increase/decrease percentage for 2019 Jan-April data to get the estimated one, but limit to 93 days or 20%.
- Average monthly consumption with no previous year: we get the total actual consumption/days for which we have data*missing number of days, but limit to 93 days or 20%.
I managed to apply the first method by generating the days we have consumption and the missing days, but I struggle with the other two methods. I have attached the flow with dummy data.
I hope that somebody can help me with a solution or to pinpoint in a direction.