Subtracting mean embeddings #2

bwang482 · 2019-10-30T13:27:45Z

Are you sure this line is correct?
X_train = X_train - np.mean(X_train)

np.mean(X_train) gives a single value. Shouldn't it be np.mean(X_train, 0) ???

The text was updated successfully, but these errors were encountered:

GuilhermeZaniniMoreira · 2020-04-03T18:05:22Z

I am gettings this error:
ValueError: operands could not be broadcast together with shapes (237191,) (300,)
There are 237.191 words with embeddins space equals to 300. How did you solve that?

iR00i · 2021-09-01T12:33:05Z

Shouldn't it be np.mean(X_train, 0) ???

If you go back to the original paper that proposed the "Post-Processing Algorithm" (All-but-the-Top: Simple and Effective Postprocessing for Word Representations), the authors outline computing the mean to be the following:

So i imagine the resulting mean should be a scaler computed from the entire matrix.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Subtracting mean embeddings #2

Subtracting mean embeddings #2

bwang482 commented Oct 30, 2019 •

edited

Loading

GuilhermeZaniniMoreira commented Apr 3, 2020

iR00i commented Sep 1, 2021

Subtracting mean embeddings #2

Subtracting mean embeddings #2

Comments

bwang482 commented Oct 30, 2019 • edited Loading

GuilhermeZaniniMoreira commented Apr 3, 2020

iR00i commented Sep 1, 2021

bwang482 commented Oct 30, 2019 •

edited

Loading