Converting plain text citations in Word into EndNote unformatted EndNote citations

Situation: you have a Word document that has no ‘live’ EndNote references in it – they were either typed manually or if EndNote or other referencing software was used, then the live references have been converted to plain text at some point and you do not have an earlier version of the document where they’re live.

This might also be useful if you’ve got a document with live references in from different referencing software that doesn’t have ‘unformatted citations’.

You have an EndNote library of the references that are used in the document (or, perhaps, you don’t yet, but you would like to get this too – see note at the end); you just want to link the EndNote library to the citations in the document.

I.e.: you have a Word document with plain text references and you want to make them live EndNote references.

HOW DO YOU DO IT?

First, if you have numbered citations, you don’t really have much to go on. You could probably do a Find and Replace to change them all to a general {Author, 1000} format, but literally that (i.e. not specific authors and years) – running the ‘Update Citations and Bibliography’ function would identify them as points where a citation is required but you’d need to then manually add the details of the reference to search and select the correct reference to add.

However, if you have some kind of author-date citations, you can convert (Smith, 2024) into {Smith, 2024} – meaning that the process will be at least partly automated.

If round brackets are not used at all in the rest of the document, other than in citations (this is unlikely unfortunately!), then simply use Find and Replace (ctrl-H) to replace ( with { then ) with }.

However, otherwise, some more complex Find & Replace functions may be necessary. In these cases, ensure ‘Use wildcards’ (in the Find and Replace box) is selected.

Find:

\((*, [0-9]{4})\)

Replace with:

{\1}

Is probably the simplest form, assuming a (Smith, 2024) format (i.e single author surname then comma, space, year – no initials or et al or other authors).

More complex:

Find:

\(([!0-9]@, )([!0-9]@, )([0-9]{4})\)

Replace with:

{\1\3}

Will convert (Smith, J., 2024) or (Smith, et al, 2024)  or (Smith, J. et al, 2024) to  {Smith, 2024}

Find:

\(([!0-9]@)( and [!0-9]@)(, [0-9]{4})\)

Replace with

{\1\3}

Will convert (Smith and Jones, 2023) to {Smith, 2023} (though you’d need to watch out for institutions with the word ‘and’ in their name)

I would always do ‘Replace’ one at a time, rather than ‘Replace All’ – those functions can end up doing weird things (due to the capacity to have flexible author strings with any number of characters), so it feels safer to check it hasn’t somehow selected a whole paragraph to replace. If you do have a mix of reference formats, I guess I’d maybe work through these in reverse order they’re listed here (ands first, then initials, et als, etc, then just all the simple one author names remaining) and I think that may work. You may need to tweak if you have other formats!

Once all the references are in the format {Smith, 2024}, hopefully, if you’re confident your EndNote library has the right references, then it should be relatively straightforward (though it will still involve some manual selecting) to ‘Update Citations and Bibliography’ and get everything linked up and functional.

IF YOU DON’T HAVE THE REFERENCES IN ENDNOTE: this is rather more work, but it is possible to automate this to some extent, if you’ve got a lot of journal references. The exact sequence of processing will depend on the reference format, but you’re aiming to convert e.g.

Smith, A., Jones, B. (2024) ‘Article about interesting things’. Journal of interesting things 123(45):67-68.

Into something like:

(smith*.au AND “article about interesting things”.ti)

That you can then compile into a long string and run as searches on a database.

It *is* possible, just arduous, so only worth it for large numbers (for middling numbers, just manually copying first author and titles (often minus punctuation is safer) into two columns in Excel. Then use an Excel function to put them all into the right format to search your chosen database (you may do this several different ways to do several different databases), copy that column into word and run and Find and Replace for a paragraph mark (which is ^p) to be replaced by ‘ AND ‘ and then you’ll have your search string.

ALTERNATIVELY, depending on format, it’s sometimes possible to actually set rules to split references up (Find and Replace in Word, looking for punctuation usually, replacing with tab-spaces) so that ALL the information can be pasted into Excel – then you can use the process to add references in Excel via a tab-delimited format into EndNote. This can be tough, but if your references are all of the same type and punctuated in distinctive ways, it is possible!

Fixing broken/corrupted EndNote citations

Uh oh! EndNote citations that look active, but some have stopped responding to EndNote at all? Won’t format into a new style nor appear in the references and won’t convert into unformatted citations? But if you view field codes, they look okay? Mysterious and frustrating?

I think this is caused by editing on other word processors, but I’m also suspicious of Track Changes and the copy & pasting of formatted citations.

Solution? There doesn’t seem to be any way to fix these broken citations directly. Best straightforward(ish) option: unformat citations, remove field codes from the document (turning the broken ones into plain text), then go through and reinsert them from EndNote manually.

If you’d been using an author-date style, you can automate this slightly by changing EndNote’s temporary citation delimiters to round brackets, meaning EndNote will go through and pick up on all the defunct citations. However, it’ll also pick up on anything else in a bracket, plus it won’t match the citations directly with the EndNote library (et als, no record numbers), so you’ll need to select & insert each citation.

Not ideal! What if you’ve got loads of these corrupted citations? And you’ve got loads of other stuff in brackets and/or you’ve used a numbered style?

THERE IS SOMETHING THAT CAN BE DONE.

This is it:

Convert all still-functional citations to unformatted citations.

Press Alt+F9 (display field codes in the document) and you should see the broken citations as field codes, including a load of data about the reference – this is what can be used.

Press Alt+F9 to switch back. Unfortunately getting the actual text of the field codes is not straightforward. But someone has made something that will do it:

http://www.gmayor.com/export_field.htm

(I can’t guarantee that this isn’t some kind of cunning virus thing, but I’m fairly confident that that’s not the case.)

Once you’ve downloaded it, then installed it, you can access it from the ‘Developer’ tab in Word.

Go through the document, highlighting it and running the converter in chunks (I think it can only process a certain amount of text at a time). (Also: avoiding headings and other non-standard text, as it’ll clear the formatting).

So, your broken EndNote citations will now be weird long field code text. But you can modify them (with Word’s Find/Replace function – Ctrl+H) so that EndNote thinks they’re unformatted citations!

Semi-colons in multiple citations are a hurdle and a few of these Find/Replaces are for dealing with them. The others are designed to clean up at least the start of each field code so EndNote will pick them up.

Find/Replace these, in this order (if ‘?’ is used, activate wildcards for that search, otherwise don’t)

19??; WITH ~CHECK DATE~

20??; WITH ~CHECK DATE~

</Cite><Cite><Author> WITH }{

ADDIN EN.CITE <EndNote><Cite><Author> WITH NOTHING

</Author><Year> WITH COMMA SPACE

</Year><RecNum> WITH SPACE HASH

</RecNum> WITH @@

&???;  WITH NOTHING

&apos; WITH NOTHING

{ADDIN EN.CITE.DATA} WITH ~BROKEN CITATION~

 

(That last one is for totally unrecoverable ones that don’t have full field code data – they’ll need to be searched for later and reinserted manually.)

Then ‘Update Citations & Bibliography’ and cross your fingers.