Where should I get the detailed information of the embeddings of the USE model #344

yoheikikuta · 2019-08-03T07:04:29Z

Hi.

I wanna understand the embeddings of the USE model in detail; where should I get the info?

For example, ELMo's embeddings are described on https://tfhub.dev/google/elmo/2.
But, in the case of USE, there is only a description the output is a 512 dimensional vector on https://tfhub.dev/google/universal-sentence-encoder/2.
From where is the output coming?

I could find the output is corresponding to a tensor <tf.Tensor 'module_apply_default/Encoder_en/hidden_layers/l2_normalize:0' shape=(?, 512) dtype=float32> of the model, but it's not easy to identify what value is exactly computed.

I read the paper of USE, so I can guess the output is like Σ_w Embed(w) / √sentence length. But I'm not sure which layer is used as the embeddings, the last layer of the Transformer Encoder? the first embedding lookup layer? or else?

Thanks.

yoheikikuta · 2019-08-12T07:01:16Z

I misunderstood the model and wrote wrong information on the description.
There are some different models of USE in TensorFlow Hub.

To clarify differences, I checked the graph visualizations of USE models (universal-sentence-encoder-large/3 and universal-sentence-encoder/2) on TensorBoard.

https://tfhub.dev/google/universal-sentence-encoder-large/3

The model default output is <tf.Tensor 'module_apply_default/Encoder_en/hidden_layers/l2_normalize:0' shape=(2, 512) dtype=float32>.
It's understandable from the following graph:

https://tfhub.dev/google/universal-sentence-encoder/2

The model default output is <tf.Tensor 'module_apply_default/Encoder_en/hidden_layers/l2_normalize:0' shape=(?, 512) dtype=float32>.
The model is based on Deep Averaging Network, so the output of DNN is fed into hidden_layers:

Though I now almost understand the models, it's difficult to trace the detailed computation in the graphs (e.g., what is the concrete expression of kernel defined in tanh_layer_0 in universal-sentence-encoder-large/3?).

Can we access to TensorFlow codes that define the models in order to understand the models in detail?

andresusanopinto · 2020-07-27T11:06:11Z

The code that defines the models is not available. Only the modules in tensorflow hub.

"https://tfhub.dev/google/universal-sentence-encoder/2" first takes Σ_w Embed(w) / √sentence length and feds it into several DNN layers, the output of the DNN layer is used as the output embedding.

"https://tfhub.dev/google/universal-sentence-encoder-large/3" has a transformer encoder and it uses the average pooling over all token embeddings at the last transformer layer as the output embedding.

rmothukuru self-assigned this Aug 5, 2019

rmothukuru added type:docs subtype:text-embedding labels Aug 5, 2019

rmothukuru assigned rmothukuru and unassigned rmothukuru Aug 5, 2019

rmothukuru added the stat:awaiting tensorflower label Aug 5, 2019

rmothukuru assigned akhorlin and unassigned rmothukuru Aug 5, 2019

andresusanopinto closed this Jul 27, 2020

Oct	NOV	Dec
	11
2019	2020	2021

tensorflow / hub

Where should I get the detailed information of the embeddings of the USE model #344

Where should I get the detailed information of the embeddings of the USE model #344

yoheikikuta commented Aug 3, 2019 •

edited

yoheikikuta commented Aug 12, 2019

andresusanopinto commented Jul 27, 2020

tensorflow / hub

Join GitHub today

Where should I get the detailed information of the embeddings of the USE model #344

Where should I get the detailed information of the embeddings of the USE model #344

Comments

yoheikikuta commented Aug 3, 2019 • edited

yoheikikuta commented Aug 12, 2019

https://tfhub.dev/google/universal-sentence-encoder-large/3

https://tfhub.dev/google/universal-sentence-encoder/2

andresusanopinto commented Jul 27, 2020

Essential cookies

Always active

Analytics cookies

yoheikikuta commented Aug 3, 2019 •

edited