Examine the linked Torch and PyTorch Im2Text implementations to understand the architecture. Study the OpenNMT tf codebase, particularly the existing modules like the encoder in onmt/modules, to determine where to add the image encoder. No existing comments or PRs provide additional context, so focus on replicating the approach within the TensorFlow framework.