Update the target networks:
                                                                    \(\theta^{Q^{\prime}}=\beta \theta^{Q}+(1-\beta) \theta^{Q^{\prime}}\)