who have some clear examples of tf.scatter_nd the official example
seemed not clear, according to
https://github.com/yaserkl/RLSeq2Seq/blob/master/code/attention_decoder.py
_calc_final_dist function
官方文档 https://www.tensorflow.org/api_docs/python/tf/scatter_nd