I have a long list of references (citations). I would like to parse each reference (string) to separate out the author(s), year, title of the article, and source (url). Since there are over 6500 references, I would obviously like to do this programmatically. My formulae seem to be working well (please see attached for a sample). However, I cannot determine why my formula for "Year" (Column D) is being inconsistent. It is working perfectly correctly until Row #5, pulling out the year where there is a year in the citation, and putting "N/A" where there isn't one. But for Row #6, I am getting N/A when I should get 1984. The funny part is if I only use the MID part of the formula, I get the correct result (as can be seen in Cell #D13). So why isn't it working in Cell $D6, but working in the other cells of that column?

Also, I obtained the formula for the position of the first number (Column B) from the internet. It works for my purpose, but I don't understand the part from the ampersand (&) onward in the formula. Why & and why are all the numbers being repeated?

Any help will be appreciated.

Thanks,

Serge

