Graph Neural Thompson Sampling
Thompson Sampling, a multi-armed bandit solution strategy is extended into the realm of contextual bandits using graph neural networks. The objective is to do resource allocation in network diffusion processes like epidemics and opinion dynamics. Currently, testing kit allocation experiments for epidemic control are done.