Problem
1. Describe Boltzmann Machines, including their relationship with Hopfield Networks, the probability distribution they implement, and the learning rule.
2. How can the learning rule be implemented?
3. Describe Boltzmann Machines with hidden units and discuss differences with Boltzmann Machines with only visible units with respect to expressive power and learning complexity.