A Structural Probe for Finding Syntax in Word Representations

John Hewitt, Christopher D. Manning

2019, NAACL 2019, Standford

Whats new This paper proves that linear transformartion of vector space of BERT and ELMO embeddings does embed liguistic structure, i.e. dependency tree.

How is it done

Linear transformation is learnt which would capture the linguist structure
Model 1: distance between words in dependency trees is as distance between two vector representation which are linear transformation of embeddings
- $d_{B}\left(\mathbf{h}_{i}^{\ell}, \mathbf{h}_{j}^{\ell}\right)^{2}=\left(B\left(\mathbf{h}_{i}^{\ell}-\mathbf{h}_{j}^{\ell}\right)\right)^{T}\left(B\left(\mathbf{h}_{i}^{\ell}-\mathbf{h}_{j}^{\ell}\right)\right)$
- $\min _{B} \sum_{\ell} \frac{1}{\left|s^{\ell}\right|^{2}} \sum_{i, j}\left|d_{T^{\ell}}\left(w_{i}^{\ell}, w_{j}^{\ell}\right)-d_{B}\left(\mathbf{h}_{i}^{\ell}, \mathbf{h}_{j}^{\ell}\right)^{2}\right|$
Model 2: level of depth in dependence tree is learnt as squared norm of linear transformation of embeddings
- $\min _{B} \sum_{\ell} \frac{1}{\left|s_{\ell}\right|} \sum_{i}\left(\left\|w_{i}\right\|-\left\|B h_{i}\right\|^{2}\right)$

What are major insights * Following figure illustarte that UAAS score of aroun d 80 is observed and it is observed at hidden layer index of about 8 in 12 layer BERT, and aroubt 18 in 24 layer BERT.

Source: Author

* As shown below, dependency tree depth is highly correlated highly with gold labels.

Source: Author

* It can be seen that linear transformation would project BERT and ELMo vectors in to space of lignuist strcture, how dimensions of these projections impact performance... beyond 64 dimensions, it does not help much to increase dimensionality.

Source: Author

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

structural_probe.md

structural_probe.md

A Structural Probe for Finding Syntax in Word Representations

John Hewitt, Christopher D. Manning

2019, NAACL 2019, Standford

Files

structural_probe.md

Latest commit

History

structural_probe.md

File metadata and controls

A Structural Probe for Finding Syntax in Word Representations

John Hewitt, Christopher D. Manning

2019, NAACL 2019, Standford