Use simulations to investigate the effect of bin choices when doing goodness of fit testing. Using a variety of Poisson, normal, and exponential distributions (add others if you like) compute the goodness of fit p-values from simulated data sets using different numbers or sizes of cells and produce a scatterplot or other type of summary to compare the results. Based on your experimental results, how much of a concern is this?