'How BERT tokenizer works for subwords?

For WordPiece tokenizer, we split the tokens like playing to play and ##ing. And we get embedding for play and ##ing separately as the representation of playing. Now I want only one representation of playing. To do this, I can do elementwise multiplication or addition of play and ##ing. But which is correct or is there any way to do this?

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution	Source

'How BERT tokenizer works for subwords?

Sources

Related Questions