Monday, April 6, 2020

Distinctive vocabulary of each of the four Gospels

A reader has provided me with a spreadsheet of all the words in the King James Version of the New Testament, with the number of occurrences of each word in each book. Using this data, I have put together these lists of the most distinctive words in each Gospel.

To be included in the list for a given Gospel, a word had to (1) occur at least 10 times in that Gospel and (2) have a frequency (per 1000 words) at least twice as high as its frequency in any of the other three Gospels.

  • Matthew: begat, deliver, end, field, gather, heaven, hypocrites, multitudes, O, prison, reward, righteous, talents, think, wherefore, wise
  • Mark: asked, began, hands, immediately, James, looked, ship, straightway, such, themselves, unclean
  • Luke: angel, certain, company, named, pass, returned, rich, sight, ten, years, Zacharias
  • John: abide, am, answered, bear, because, beginning, believe, believed, believeth, born, die, disciple, doeth, ever, except, Father, feast, flesh, glorified, greater, hast, honour, I, Jews, judge, keep, knew, know, known, Lazarus, life, lifted, light, love, loved, me, might, mine, myself, now, Philip, record, scripture, seek, seen, seeth, sent, sheep, Simon, sin, true, truth, verily, water, whence, whither, whom, witness, words, work, world, yet

There's obviously a lot of noise here, but one can still see the familiar areas of emphasis of each Gospel. Matthew is the most moralistic (righteous, hypocrites, reward, prison). Mark is the fastest-paced (immediately, straightway) and has a special focus on exorcism (unclean spirits). Nothing in particular stands out about Luke. John is, of course, the most distinctive; aside from the familiar themes (believe, know, life, light, love, truth), we can see that it is the Gospel in which Jesus talks the most about himself (I, me, mine, myself, am). It is interesting to note that sin (singular) occurs almost exclusively in John, while sins and sinners are more typical of the Synoptics. This is consistent with the idea (see here) that John means something else by sin than "moral transgression."


It can also be interesting to look at the words that meet the converse requirements: (1) occurring at least 10 times in each of the other three Gospels and (2) being no more than half as frequent in the target gospel as in any of the other three. No words meet these requirements for the Gospel of Matthew, but here are the lists for the other three.

  • Mark: hast, law, light, nor, prophet, then
  • Luke: answered, disciples, king, saith, sea, verily
  • John: against, began, children, devils, enter, fell, great, house, kingdom, multitude, or, over, pass, scribes, their, upon, wife

We can see here how John differs from the Synoptics in hardly mentioning the kingdom of God, which (with its Matthean variant kingdom of heaven) seems in the Synoptics to be the center of Jesus' whole message. The complete absence of devils (plural; the devil is mentioned about equally in all the Gospels) reflects the fact that there are, uniquely, no exorcisms in the Fourth Gospel.

1 comment:

Bruce Charlton said...

This demonstrates that vocabulary can be used as a proxy for content.

Happy 85th birthday, Jerry Pinkney

Poking around a used bookstore this afternoon, I felt a magnetic pull to a particular book, which, when I took it down from the shelf, turne...