*** Welcome to piglix ***

Pangrammatic window


A pangrammatic window is a stretch of naturally occurring text that contains all the letters in the alphabet.

The shortest known naturally occurring pangrammatic window was discovered in October 2014 through an automated processing of Google's indexed webcorpus, found in a review of the movie Magnolia written by Todd Manlow on the website PopMatters: "Further, fractal geometries are replicated on a human level in the production of certain “types” of subjectivity: for example, aging kid quiz show whiz Donnie Smith (William H. Macy) and up and coming kid quiz show whiz Stanley Spector (Jeremy Blackman) are connected (or, perhaps, being cloned) in ways they couldn’t possibly imagine." at 36 letters.

The shortest known window in a published work is found in Piers Anthony's book Cube Route: "We are all from Xanth," Cube said quickly. "Just visiting Phaze. We just want to find the dragon." at 42 letters occurring on page 98 of the 2004 First Mass Market Edition published by Tor.

Prior to that, the shortest known window in a published work was found in Lillie de Hegermann-Lindencrone's 1912 book In the Courts of Memory: "I sang, and thought I sang very well; but he just looked up into my face with a very quizzical expression, and said, 'How long have you been singing, Mademoiselle?'" at 56 letters. A 47-letter example has, however, been found on the internet: "JoBlo's movie review of The Yards: Mark Wahlberg, Joaquin Phoenix, Charlize Theron...".

Generally, according to the law of probability, the shorter the work, the longer the minimal pangrammatic window (if any) will be.

Some estimates can be made using the frequencies of the letters. Assuming that the incidence of each letter in a passage is independent of the incidence of the other letters, and provided m is much larger than 26, the probability that a sequence of length m will contain all 26 letters is approximately P(a)P(b)...P(y)P(z), where P(letter) = 1 − (1 − p(letter))m and p(letter) is the frequency of the letter as a fraction (e.g. 5% is 0.05). Inputting the letter frequencies for the English language, the probability that a 1,700-letter sequence will contain all 26 letters is about 50%. At 1,000, there is about a 19.5% chance, and at 2,500, there is about a 73% chance.


...
Wikipedia

...