When modelling the properties of a system, you may find yourself needing to integrate over an impractically large set. For example, may map the status of all nodes in a system to a single “up” or “down” signal, and we aim to compute uptime by integrating over with respect to a measure .
To avoid this, we can find alternatives to and which allow us to compute uptime with a set of size rather than a set of size . The new functions and approximate the probability the system is “up” given “up” nodes and the probability that nodes are up.
Computationally, this should be a massive improvement, but this strategy doesn’t guarantee it’s “easy” to integrate with respect to . I recently encountered such a situation and found that Harris Inequality was a useful tool to get bounds on the property I was looking for. For non-decreasing functions, and :
Applying this to our setting, we obtain the following useful bounds. In the following, I’m assuming that is increasing on and decreasing on .
- For illustration’s sake (Desmos), let’s consider a model of uptime that we’ve approximated with the following equations (notice that both are increasing!). Do not worry where they come from, I am pulling them out of thin air. For a node system, the probability that nodes are up depends on some constant . To check the “uptime” conditional on or more nodes being up, we can apply the above result to our uptime function and probability measure.
- With all of this said, it’s unlikely that would be significantly less challenging for the computer than . I don’t really know, maybe one dot product is just too many.