首页>
外国专利>
METHOD OF GENERATING TRAINING DATA FOR TRAINING A NEURAL NETWORK, METHOD OF TRAINING A NEURAL NETWORK AND USING NEURAL NETWORK FOR AUTONOMOUS OPERATIONS
METHOD OF GENERATING TRAINING DATA FOR TRAINING A NEURAL NETWORK, METHOD OF TRAINING A NEURAL NETWORK AND USING NEURAL NETWORK FOR AUTONOMOUS OPERATIONS
展开▼
机译:生成用于训练神经网络的训练数据的方法,训练神经网络和使用神经网络进行自主手术的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method of generating training data for training a neural network, method of training a neural network and using a neural network for autonomous operations, related devices and systems. In one aspect, a neural network for autonomous operation of an object in an environment is trained. Policy values are generated based a sample data set. An approximate action-value function is generated from the policy values. A set of approximated policy values is generated using the approximate action-value function for all states in the sample data set for all possible actions. A training target for the neural network is calculated based on the approximated policy values. A training error is calculated as the difference between the training target and the policy value for the corresponding state-action pair in the sample data set. At least some of the parameters of the neural network are updated to minimize the training error.
展开▼