I started writing folk model structure on Cat with an explicit summary of the construction, and a description of how it can be modified to work if you assume only COSHEP. I feel like there should also be a "dual" model structure assuming some other weakening of choice, in which all categories are cofibrant and the fibrant objects are the "stacks", but I haven't yet been able to make it come out right.
