Summary: | The recent development of High Throughput Sequencing technologies has enabled an individual's TCR repertoire to be efficiently analysed at the nucleotide level. However, with unique clonotypes ranging in the tens of millions per individual, this approach gives a surfeit of information that is difficult to analyse and interpret in a biological context and gives little information about TCR structural diversity. Using publicly available TCR CDR3 sequence data, we analysed TCR repertoires by converting the encoded CDR3 amino acid sequences into Kidera Factors, a set of orthogonal physico-chemical properties that reflect protein structure. This approach enabled the TCR repertoire from different individuals to be distinguished and demonstrated the close similarity of the repertoire in different samples from the same individual.
|