File:T-SNE visualisation of word embeddings generated using 19th century literature.pdf
Original file (1,658 × 1,125 pixels, file size: 963 KB, MIME type: application/pdf)
Captions
This file has been superseded by T-SNE visualisation of word embeddings generated using 19th century literature.png. It is recommended to use the other file. Please note that deleting superseded images requires consent. Reason to use the other file: "A PNG version of this file is now available."
|
Summary
[edit]DescriptionT-SNE visualisation of word embeddings generated using 19th century literature.pdf |
English: Word embedding algorithms derive a set of real-valued vectors representing the vocabulary of a text corpus in a new embedded space. This provides a useful means of measuring the underlying similarity between words.
This image consists of word embeddings generated from 19th century literature. Gender-encoded unigrams, such as ‘she’ and ‘he’, by female authors are depicted as large, pink circles while the corresponding male authored unigrams are depicted as large, grey circles. Gender-encoded embeddings occupy four different spaces within this embeddings projection annotated A-D. A: Female- and male-authored plural nouns {fellows, women, men,..} surrounded by past-participles verbs. No family related nouns such as {daughters, sisters, brothers} by female authors despite presence of male-authored counterparts. B: Singular gender-encoded nouns by both female and male authors nested within nouns referring to (typically male) occupations {priest, clerk, magistrate, farmer,..}. All male-authored pronouns but only one female authored pronoun, "himself". C: Family related nouns (singular and plural) by only female authors, nested within a cluster of characters predominately from Jane Austen’s novels. D: Female authored pronouns next to past-participles and past verbs. Provides interesting counterpoint to Argamon et al. [1] who found differences in how women and men use words particularly personal pronouns. [1] Argamon, S., Koppel, M., Fine, J., Shimoni, A.R.: Gender, genre, and writing style in formal written texts. TEXT 23, 321–346 (2003)English: Word embedding algorithms derive a set of real-valued vectors representing the vocabulary of a text corpus in a new embedded space. This provides a useful means of measuring the underlying similarity between words. This image consists of word embeddings generated from 19th century literature. Gender-encoded unigrams, such as ‘she’ and ‘he’, by female authors are depicted as large, pink circles while the corresponding male authored unigrams are depicted as large, grey circles. Gender-encoded embeddings occupy four different spaces within this embeddings projection annotated A-D. A: Female- and male-authored plural nouns {fellows, women, men,..} surrounded by past-participles verbs. No family related nouns such as {daughters, sisters, brothers} by female authors despite presence of male-authored counterparts. B: Singular gender-encoded nouns by both female and male authors nested within nouns referring to (typically male) occupations {priest, clerk, magistrate, farmer,..}. All male-authored pronouns but only one female authored pronoun, "himself". C: Family related nouns (singular and plural) by only female authors, nested within a cluster of characters predominately from Jane Austen’s novels. D: Female authored pronouns next to past-participles and past verbs. Provides interesting counterpoint to Argamon et al. [1] who found differences in how women and men use words particularly personal pronouns. [1] Argamon, S., Koppel, M., Fine, J., Shimoni, A.R.: Gender, genre, and writing style in formal written texts. TEXT 23, 321–346 (2003) |
Date | |
Source | Own work |
Author | Siobhán Grayson |
Licensing
[edit]- You are free:
- to share – to copy, distribute and transmit the work
- to remix – to adapt the work
- Under the following conditions:
- attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.
This image was uploaded as part of Wiki Science Competition 2017. |
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 14:30, 30 November 2017 | 1,658 × 1,125 (963 KB) | Siobhangrayson (talk | contribs) | User created page with UploadWizard |
You cannot overwrite this file.
File usage on Commons
The following page uses this file:
Metadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Conversion program | Mac OS X 10.10.5 Quartz PDFContext |
---|---|
Encrypted | no |
Page size | 796 x 540 pts |
Version of PDF format | 1.3 |