In this problem, we explore the use of deterministic annealing for pattern classification using a neural network (Miller et al., 1996). The output of neuron j in the output layer is denoted by Fj(x), where x is the input vector. The classification decision is based on the maximum discriminant Fj(x)
(a) For a probabilistic objective function, consider the expression
![965_51349c7a-92b1-4644-9846-f6018d976ac6.png](https://secure.tutorsglobe.com/CMSImages/965_51349c7a-92b1-4644-9846-f6018d976ac6.png)
![1367_702f6e4e-e486-402d-abdc-af1fc43581ca.png](https://secure.tutorsglobe.com/CMSImages/1367_702f6e4e-e486-402d-abdc-af1fc43581ca.png)