Semantic resources project/MouseModels/Missing strains

Often a strain name shows up as a part of the name of another strain, e.g. as the host, donor, or "helper" strain involved in the creation of a congenic strain. For example, in NOD.CZI-Idd3/MrkTacJ, NOD (the host) and CZI (the donor) are both strain names.

Sometimes these auxiliary strain names are abbreviations; this is documented in the strain nomenclature guide.

The question is, do all of these auxiliary strain names show up in the strain report? The answer is no; about 157 are missing. In the example, NOD is in the strain list, but CZI is not.

In some cases I've failed to find a match because my parser isn't sophisticated enough, or because the lab code gets dropped when the name is incorporated (e.g. FUBI/Rbrc shortened to FUBI), but many of these names remain unresolvable even after a manual search through the strain report.

Query: prefix rdfs:  prefix owl:  prefix :  select distinct ?nn from  where {   {      { ?strain :hasHost ?s. }     union { ?strain :hasDonor ?s. }     union { ?strain :hasBackground ?s. }     union { ?strain :hasHelper ?s. }   }      ?s :hasStrainName ?nn. optional { ?s :hasMgiId ?id. }   filter(!(bound(?id))). } order by ?nn

Result (with some manual annotations, non-comprehensive): 101H    -- 101/H 101Rl   -- 101/Rl 129P 129P2 129P4 129PS 129S 129Sv 129T2 129X 129X1 129X1Bom 129X1Ms  -- perhaps 129X1/SvJMs ? 129XB 5R AKRMs  -- probably AKR/Ms B10Gn B10Rl   -- probably C57BL/10Rl B10ScSn   -- C57BL/10ScSn B10Sn    -- C57BL/10Sn B10SnJ   -- C57BL/10SnJ B10SnSlc  -- C57BL/10SnSlc B6129 B6Bm B6By B6ByJ B6C3Fe B6CB B6CBA B6ChR B6Cr B6CrSlc B6D2 B6Ei B6EiC3 B6EiC3Sn B6J B6JEi    -- possibly C57BL/6JEiJ ? B6JHsd  -- C57BL/6Hsd  ??? B6JJcl B6N B6NCrj B6NCrl B6NHsd B6NTac B6Pin B6Rcc B6Ros B6SJL B6Slc B6Smn B6cC3 BALB BKS BKSChpLt BKSW BKa BLiA BP1 BSXB BTBRTFNev BTNTTF BTNTTFArt BXD29 BXD8 By C3A C3Bir C3Fe C3FeJ C3FeLe C3Fg C3Ga C3H101H C3HeJ     -- C3H/HeJ ? C3N C3NJcl C3Ou C3Pas C3Rl C3Sn  -- C3H/Sn ? C3SnSmn C3Wf C3fB6  -- note infix 'f' notation. C3fBi   -- infix 'f' C57BR/CD   -- suspiciously, there is a C57BR/cd C57L/J       -- this strain is in the strain report; bug in query CA CAJcl CAL CAn CAnBomUrd CB6 CBACa   -- there is a CBA/Ca CBACaGnLe CBACaH  -- CBA/CaH CBAGr CBAH CBAJms CBALs CBB6 CBy CByJ CByJcl CBySmn CCi CD1 CD2 CF CF1 CGr CHa CNCr CPB CPt CXB1 CZI CcS3 CnbcLmon:NMRI D D1Lac D2N FUBI  -- probably FUBI/Rbrc FVBHanHsd FVBTac     -- perhaps FVB/NTac ? HA HG INT KB2 KOR1 LIII M MEV MF1 MP1 NODCaj   -- but there is a NOD/Caj NX129 NZM2328 ORNL:STOCK OcB3  -- is this a cross? (infix 'c') P2 PA PROD Peru   -- suspicious, as there is a 'PERU' R1 R2 R201 R203 R205 R209 R5 RIIIS/J       -- false, see above SEACGn SLKh SPRETEi  -- there is a SPRET/Ei TLHO WLC