Lab 6: Language Models

Download the files for Lab 5A from the following links:

We recommend that you use Google Colab, as training will be faster on the GPU.

To enable the GPU on Colab, go to Edit / Notebook settings / Hardware accelerator / select T4 GPU

Instructions on how to download and use Jupyter Notebooks can be found here. You can find a static version of the notebook below.

Lab_9

Lab 6: Language Models¶

We can think of language as a time series. In this interpretation, each word in a sentence corresponds to the equivalent of a different point in time and the words themselves represent different vectors of the time series. Consider as an example the first lines spoken by Miranda in The Tempest^[1]")

========== INPUT ==========
Alas mother ,
========== INPUT + GENERATED TEXT ==========
Alas mother, 
 All children puts your suit 
 Marcus that, 
 That taught was for herself. Why, 
 It is life and prey to play their chamber! and have I, seeing thou tapp'd cousin, but hear have effect my forest woods for us his face bride canst, 
 That thus affect'st disorder'd: 
 I thank an elder; 
 Let me, 
 Dare as a judge. 
 
 BENVOLIO: 
 Out, unvalued this: 
 I think not; 
 Lest them, or stones? anon, ho
====================================
========== INPUT ==========
All the world's a stage , and all the men and women merely players .
========== INPUT + GENERATED TEXT ==========
All the world's a stage, and all the men and women merely players. 
 
 ABHORSON: 
 Who is't of your highness? 
 
 ISABELLA: 
 Friar but my the wrong; for, sir, now, I must also. 
 
 ABHORSON: 
 We take you to you both some news: I must, and you may it, 
 give it your honour's, shall not be true; 
 For I be your best- bed, 
 Against the duty be join'd: although your natural stay, 
 And follow for't. 
 
 LUCIO: 
 
 ANGELO: 

====================================
========== INPUT ==========
A fool thinks himself to be wise , but a wise man knows himself to be a fool .
========== INPUT + GENERATED TEXT ==========
A fool thinks himself to be wise, but a wise man knows himself to be a fool. 
 
 ABHORSON: 
 Pray, let it strike from the right- place; 
 Be your due. O and that's the boy, 
 When you are weary most set, he hath done 
 I'll hold him for a man; no man could not 
 desire his issue grace. 
 
 SICINIUS: 
 But, if you love. 
 
 MENENIUS: 
 That's o'er hands Edward, Myself, and death again, Angelo. 
 
 RICHMOND: 
 The princes and noble King Henry- tree 
 Is
====================================
========== INPUT ==========
How beauteous mankind is ! O brave new world !
========== INPUT + GENERATED TEXT ==========
How beauteous mankind is! O brave new world! 
 Never what news! arm, and not I; 
 For he hath a man, some power in the rain. 
 
 Third Watchman: 
 Tis so, my good lord. 
 
 LEONTES: 
 Why, then, then, tis well better to be; 
 The blood that art not, to the nobility doth ride 
 An cold makes what I return to be at home, 
 When he. My liege, if you were gracious people, 
 Your ancient father to make his heavenly Edward
====================================
========== INPUT ==========
O brave new world , that has such people in't !
========== INPUT + GENERATED TEXT ==========
O brave new world, that has such people in't! 
 
 QUEEN MARGARET: 
 Thou hast the throne; and thou shalt be proclaim'd, 
 Be valiant to thy purpose. 
 
 HENRY BOLINGBROKE: 
 Kind oath, Catesby, wronged; and, friends, and dare, and-- 
 
 FROTH: 
 Nay, take your honour. 
 
 First Senator: 
 Be you the city at your sin. 
 
 ISABELLA: 
 Nay, dispatch, indeed, she speaks your love length. 
 
 LADY CAPULET: 
 Are down have I,
====================================
========== INPUT ==========
The fault , dear Brutus , is not in our stars , but in ourselves , that we are underlings .
========== INPUT + GENERATED TEXT ==========
Error generating from prompt: 'underlings'
========== INPUT ==========
To be , or not to be : that is the question .
========== INPUT + GENERATED TEXT ==========
To be, or not to be: that is the question. 
 
 LUCIO: 
 
 Provost: 
 What further abroad abroad? 
 
 POMPEY: 
 Thou art a fool, and not a need, 
 Soon, dishonour than thy riches, hath lengthen'd 
 Till forth and empty; he bears my oath, 
 To meet and answer, I beseech you, now: out 
 And steal such a life? 
 
 MAMILLIUS: 
 I learnt your moved the wrong, ask. 
 
 DUCHESS OF YORK: 
 Either me my royal use? I am heard
====================================
========== INPUT ==========
Cowards die many times before their deaths ; the valiant never taste of death but once .
========== INPUT + GENERATED TEXT ==========
Error generating from prompt: 'Cowards'
========== INPUT ==========
The better part of Valour , is Discretion .
========== INPUT + GENERATED TEXT ==========
Error generating from prompt: 'Valour'
========== INPUT ==========
Love all , trust a few , do wrong to none .
========== INPUT + GENERATED TEXT ==========
Love all, trust a few, do wrong to none. 
 
 GLOUCESTER: 
 Welcome, how now the mark Juliet up, 
 To know the king is nothing: meantime my death 
 Persuade all but seldom for that time means; and, as cheap, 
 Take more than one, more to kiss the open 
 As we should do, for themselves for the gods serve. 
 Meantime, Romeo! what a one was gadding is to live. 
 
 GLOUCESTER: 
 What ever the sun devise out to- morrow morning? 
 Ah, what a man
====================================
========== INPUT ==========
Some are born great , some achieve greatness , and some have greatness thrust upon them .
========== INPUT + GENERATED TEXT ==========
Some are born great, some achieve greatness, and some have greatness thrust upon them. 
 I saw his soul: and his noble heart 
 To undertake we home you and your grace 
 Upon his party with him. And to tell him: 
 An the glory is he tyrannous, and took 
 A moiety like the witness o the worst. 
 
 MENENIUS: 
 The consent and stand his again is out: 
 A most heir- sickness will command and 
 Only their swords. 
 
 ISABELLA: 
 Hath not the matter; the people they shall be gone. 
 
 AUTOLYCUS:
====================================
========== INPUT ==========
The lady doth protest too much , methinks .
========== INPUT + GENERATED TEXT ==========
The lady doth protest too much, methinks. 
 Hence, gentle Richard: they must have used 
 Till, as mistaken were. 
 
 LUCIO: 
 Then shall know my constant may do this brawl. 
 
 LADY ANNE: 
 If I had the grace so fair up. 
 
 QUEEN MARGARET: 
 So much will all within within that are mad 
 Betwixt to nose that is a king of grief. 
 
 KING RICHARD II: 
 What if you will, I hear. 
 
 BARNARDINE: 
 Nay, nurse, gentlemen; grieve
====================================
========== INPUT ==========
Good night , good night ! Parting is such sweet sorrow , that I shall say good night till it be morrow .
========== INPUT + GENERATED TEXT ==========
Error generating from prompt: 'Parting'
========== INPUT ==========
We are such stuff as dreams are made on , and our little life is rounded with a sleep .
========== INPUT + GENERATED TEXT ==========
Error generating from prompt: 'rounded'
========== INPUT ==========
But , soft ! What light through yonder window breaks ? It is the east , and Juliet is the sun .
========== INPUT + GENERATED TEXT ==========
But, soft! What light through yonder window breaks? It is the east, and Juliet is the sun. 
 good uncle not- beards. 
 
 SICINIUS: 
 Here. 
 
 BRUTUS: 
 Attend your tribunes? 
 
 BRUTUS: 
 Let's with point counsel safe. 
 
 MENENIUS: 
 With fair gentlewoman, and make one, and go on, 
 Have more given now and made us ill lies. 
 Have you no man that, he has amen fled. 
 
 BRAKENBURY: 
 I do, my lord, I would not move you; but what he hath? 
 
 CAPULET:
====================================
========== INPUT ==========
Good night , good night ! Parting is such sweet sorrow , that I shall say good night till it be morrow .
========== INPUT + GENERATED TEXT ==========
Error generating from prompt: 'Parting'

Lab 6: Language Models¶

0. Environment setup¶

Word Embeddings¶

Task 1¶

Data¶

Data Loading¶

Tokenization¶

Functions to encode and decode words into indices¶

Converting dataset into tokens¶

Cooccurrence Matrices¶

Task 2¶

Eigenvector Embeddings¶

Principal Component Analysis¶

Language Transformers¶

MultiHeadLayer¶

LanguageTransformer¶

Next Word Prediction¶

Task 5¶

Data Split¶

Dataset¶

6. Probability Readout¶

Task 6¶

Task 7¶

Model Sampling¶

Task 8¶

Language Generation¶

Task 9¶

Positional Encoding¶

Task 10¶

Practical Considerations¶

Layer Normalization¶

Task 11¶

Future Masking¶

Task 12¶

Task 13¶

Task 14¶