1 Supplementary Material for: The generation of thermostable fungal laccase chimeras by SCHEMA-RASPP structure-guided recombination in vivo Ivan Mateljak a, Austin Rice b, Kevin Yang b, Thierry Tron c and Miguel Alcalde a* a Department of Biocatalysis, Institute of Catalysis, CSIC, Cantoblanco 2849, Madrid, Spain. b Division of Chemistry and Chemical Engineering, California Institute of Technology, CALTECH, Pasadena CA 91125-4, USA. c Aix Marseille Université, Centrale Marseille, CNRS, ism2 UMR 7313 13397 Marseille, France. * Address correspondence to: Miguel Alcalde, malcalde@icp.csic.es. Supplementary material contains 7 supplementary figures, two supplementary tables and supplementary methods (two additional SCHEMA-RASPP scripts and details of PCR reactions).
2 Figure S1. ABTS activity of parental laccases fused to 3 different signal peptides in microplate format. (A) OB-1; (B) Lac3; (C) 3PO. Individual clones of each laccase construct were cultured in sterile 96-well plates containing 2 μl of SEM. After fermentation, the plates were centrifuged and 2 μl of supernatant was transferred to new 96-well plates. The activity of each construct was measured by adding 18 µl of mm citrate phosphate buffer ph 4. containing 3 mm ABTS. Each point and standard deviation is taken from three independent measurements. Figure S2. Multiple sequence alignment of the parental types. Conserved residues are highlighted in grey: Block 1, red; Block 2, cyan; Block 3, purple; and Block 4, green. The crossover locations are highlighted with black arrows (with the first residue of a block in bold) and the regions used to attach the blocks by homologous recombination are underlined. Residues of parental sequences (Lac3 and 3PO) modified to fit to OB-1 sequence for in vivo recombination are highlighted as black circles. Figure S3. Cloning strategy for efficient transformation in S. cerevisiae. Each primer is depicted as a black arrow, with the 4 bp overhangs based on the OB-1 sequence highlighted as blue wavy lines. Crossover positions are represented as yellow rectangles. Figure S4. Distribution of SCHEMA blocks in the thermostable chimeras. OB-1, grey; Lac3, red; 3PO, black. Figure S5. C-terminus of AAA B C chimera represented as a cartoon. OB-1, grey; Lac3, red; 3PO, black. T1Cu is depicted as a blue sphere and the mutated residues are represented as sticks. Figure S6. Alignment of the parental types and the catalytic pocket residues of all the chimeras. Conserved residues are highlighted in yellow.
3 Figure S7. ph stability of the parental types, and the 5 most active and stable chimeras. Laccase variants (A) AAAA (OB-1), (B) BBBB (Lac3), (C) CCCC (3PO), (D) AAA B C, (E) CBBC, (F) ABBC (G) B C BBC and (H) BBBC were incubated for, 4, 24, 48, 72 and 96 h at different ph values (2-9). The residual activity after each incubation was measured with 3 mm ABTS in mm sodium phosphate/citrate buffer ph 4.. Laccase activity was normalized to the highest activity value at time. Each point and standard deviation is from three independent measurements.
Figure S1 A 25 B 6 2 5 ABTS activityu/l ABTS activity (U/L) 15 ABTS activityu/l ABTS activity (U/L) 4 3 2 5 C 35 α PcL OB-1 α PM1 OB-1 α native OB-1 α PcL Lac3 α PM1 Lac3 α native Lac3 3 ABTS activityu/l ABTS activity (U/L) 25 2 15 1 5 α PcL 3PO α PM1 3PO α native 3PO
Figure S2 OB-1 SIGPVADLTISNGAVSPDGFSRQAILVNDVFPSPLITGNKGDRFQLNVIDNMTNHTMLKS 6 Lac3 AIGPVTDLTISDADISPDGFTRAAVVMNDQFPGPLIAGNKGDNFQINVIDNLSNSTMLTS 6 3PO AIGPVADLTLTNAQVSPDGFAREAVVVNGITPAPLITGNKGDRFQLNVIDQLTNHTMLKT 6 OB-1 TSIHWHGFFQHGTNWADGPAFVNQCPISTGHAFLYDFQVPDQAGTFWYHSHLSTQYCDGL 12 Lac3 TTIHWHGFFQKGTNWADGAAFVNQCPISAGNSFLYDFTATDQAGTFWYHSHLSTQYCDGL 12 3PO SSIHWHGFFQQGTNWADGPAFVNQCPIASGHSFLYDFQVPDQAGTFWYHSHLSTQYCDGL 12 OB-1 RGPIVVYDPQDPHKSLYDVDDDSTVITLADWYHLAAKVGPAAPT-ADATLINGLGRSIN- 178 Lac3 RGPMVVYDPDDPHASLYDVDDDSTVITLSDWYHTAARLGARFPAGADSTLINGLGRAAGG 18 3PO RGPFVVYDPNDPHASLYDIDNDDTVITLADWYHVAAKLGPRFPFGSDSTLINGLGRTTG- 179 OB-1 TLNADLAVITVTKGKRYRFRLVSLSCDPNYTFSIDGHSLTVIEADGVNLKPQTVDSIQIF 238 Lac3 DADAALAVFNVTQGSRYRFRLVSLSCDPNFNFTIQDHNMTIIEVDGVNVEPVTVDSIQIF 24 3PO IAPSDLAVIKVTQGKRYRFRLVSLSCDPSHTFSIDNHTMTIIEADSINTQPLEVDSIQIF 239 OB-1 PAQRYSFVLNADQDVDNYWIRALPNSGTRNFDGGVNSAILRYEGAAPVEPTTTQTPSTQP 298 Lac3 AGQRYSFVLTADQDIGNYWIQAVPNTGTVTTDGGVNSAILRYDTADPIEPDAADPTSSIP 3 3PO AAQRYSFVLDASQPVDNYWIRANPAFGNTGFAGGINSAILHYDGAPEIEPTSVQTTPTKP 299 OB-1 LVESALTTLEGTAAPGNPTPGGVDLALNMAFGFAGGRFTINGASFTPPTVPVLLQILSGA 358 Lac3 LVETDLVPLENLAAPGDPTVGGVDLAMNLEFDFNGTWFFINGEPFVPPSVPVLLQIMSGA 36 3PO LNEVDLHPLSPMPVPGSPEPGGVDKPLNLVFDFNGTNFFINNHTFVPPSVPVLLQILSGA 359 OB-1 QSAQDLLPSGSVYSLPANADIEISLPAT-SAAPGFPHPFHLHGHTFAVVRSAGSSTYNYA 417 Lac3 QSAADLLPSGSVYTLPANSTIEISFPMNTTAAPGAPHPFHLHGHTFYVVRSAGSTEYNYV 42 3PO QAAQDLVPEGSVFVLPSNSSIEISFPAT-ANAPGFHHPFHLHGHAFAVVRSAGSSVYNYD 418 OB-1 NPVYRDVVNTGSPGDNVTIRFRTDNPGPWFLHCHIDFHLEAGFTVVMAEDIPDVAATNPV 477 Lac3 NPPQRDTVSTGTDGDNVTIRFTTNNPGPWFLHCHIDFHLDAGFAIVLSEDTPDAASANTP 48 3PO NPIFRDVVSTGQPGDNVTIRFETNNPGPWFLHCHIDFHLDAGFAVVMAEDTPDTKAANPV 478 OB-1 PQAWSDLCPTYDALSPDDQ-- 496 Lac3 SSAWDDLCPTYNTDYPDGLGR 51 3PO PQAWSDLCPIYDALDPSDL-- 497
Figure S3 OB1 F1 F2 F3 F4 168 31 442 Signal peptide R1 R2 R3 R4 3PO F1 F2 F3 F4 4bp 2bp 4bp 2bp 4bp 2bp 168 31 442 Signal peptide 2bp 4bp 2bp 4bp 2bp 4bp R1 R2 R3 R4 Lac3 F1 Signal peptide F2 F3 F4 4bp 2bp 4bp 2bp 4bp 2bp 168 31 442 2bp 4bp 2bp 4bp 2bp 4bp R1 R2 R3 R4
Figure S4 8 6 % 4 2 1 2 3 4 Blocks
Figure S5 S492D A472K I468T D494S T487I V471T T474A Q496L T461A E457D R439T T1 P43D D441N V424T N426S S429T
Figure S6 FRAGMENT 1 FRAGMENT 2 FRAGMENT 3 FRAGMENT 4 AAAA A P T 164 25 D 263 N S 264 331 F F 336 39 P G F 392 455 H A 458 BBBB F P F 164 26 D 264 A F 265 332 F F 337 391 P G F 393 456 H A 459 Parents CCCC F P A 164 27 D 265 N T 266 333 F F 338 393 P G A 395 458 H A 461 Chimeras CABC F P F 164 26 D 264 N S 265 332 F F 337 392 P G A 394 457 H A 46 CCBC F P F 164 26 D 264 A F 265 332 F F 337 392 P G A 394 457 H A 46 ACAC A P T 164 25 D 263 A F 264 331 F F 336 39 P G F 392 455 H A 458 ACAA A P T 164 25 D 263 A F 264 331 F F 336 39 P G F 392 455 H A 458 AAA B C A P T 164 25 D 263 N S 264 331 F F 336 39 P G F 392 455 H A 458 B C BBC F P F 164 27 D 265 N T 266 333 F F 338 393 P G A 395 458 H A 461 BBBC F P A 164 27 D 265 N T 266 333 F F 338 393 P G A 395 458 H A 461 AAAC A P T 164 25 D 263 N S 264 331 F F 336 39 P G F 392 455 H A 458 A C CBC A P T 164 26 D 264 A F 265 332 F F 337 392 P G A 394 457 H A 46 CBBC F P F 164 27 D 265 N T 266 333 F F 337 393 P G A 395 458 H A 461 CAA C A F P F 164 26 D 264 N S 265 332 F F 337 391 P G F 393 456 H A 459 ABAC A P T 164 26 D 264 N T 265 332 F F 337 391 P G F 393 456 H A 459 ABAA A P T 164 26 D 264 N T 265 332 F F 337 391 P G F 393 456 H A 459 ABBC A P T 164 26 D 264 N T 265 332 F F 337 392 P G A 394 457 H A 46
Figure S7 A 14 B 14 C 14 Residual residual activity (%) (%) 12 8 6 4 2 Residual activity (%) residual activity (%) 12 8 6 4 2 Residual residual activity (%) (%) 12 8 6 4 2 D 14 2 3 4 5 6 7 8 9 ph E 14 2 3 4 5 6 7 8 9 ph F 14 2 3 4 5 6 7 8 9 p H ph ABBC Residual residual activity activity (%) (%) 12 8 6 4 2 Residual residual activity (%) (%) 12 8 6 4 2 Residual residual activity (%) (%) 12 8 6 4 2 G 14 2 3 4 5 6 7 8 9 ph H 14 2 3 4 5 6 7 8 9 ph 2 3 4 5 6 7 8 9 ph Residual residual activity (%) (%) 12 8 6 4 2 Residual residual activity (%) (%) 12 8 6 4 2 h 4 h 24 h 48 h 72 h 96 h 2 3 4 5 6 7 8 9 ph 2 3 4 5 6 7 8 9 ph
Table S1. Characteristics of SCHEMA-RASSP libraries Library description number Crossover points E m (Numbering based on mature PM1 aka OB1) of blocks Identity scores underneath 4 8.1111 58.1481 168 31 442 4 1 5 5 11.1111 63.1111 124 21 31 442 6 3 1 5 6 14.1111 65.9863 124 29 31 373 442 6 4 1 5 5 7 12.4444 65.368 43 124 166 29 31 442 5 6 3 4 1 5
Table S2. Oligonucleotides used in this study BLOCK 1 OB-1 Lac3 3PO RMLN for: 5 -CCTCTATACTTTAACGTCAAGG-3 Oligonucleotides OB1-168 rev: 5 -GCGACCGAGGCCGTTGATAAGAGT-3 RMLN for: 5 -CCTCTATACTTTAACGTCAAGG-3 Lac3-168 rev: 5 -GCGACCGAGGCCGTTGATAAGAGTAGCATCGGCCCCAGCAGGGAAACGAGCGCC-3 RMLN for: 5 -CCTCTATACTTTAACGTCAAGG-3 3PO-168 rev: 5 -GCGACCGAGGCCGTTGATAAGAGTAGCATCGGCGCCAAATGGGAAGCGAGGTCC-3 BLOCK 2 OB-1 Lac3 3PO Oligonucleotides OB1-168 for: 5 -GCCGATGCTACTCTTATCAACGGCCTCGGTCGC-3 OB1-31 rev: 5 -GTCGACACCGCCAGGGGTCGGGTTGCCGGGCGCAGCGGT-3 Lac3-168 for: 5 -GCCGATGCTACTCTTATCAACGGCCTCGGTCGCGCGGCAGGCGGCGATGCGGACGCCG-3 Lac3-31 rev: 5 -GTCGACACCGCCAGGGGTCGGGTTGCCGGGCGCAGCGGTGTTCTCCAGCGGGACGAGGTC-3 3PO-168 for: 5 -GCCGATGCTACTCTTATCAACGGCCTCGGTCGCACCACTGGCATAGCACCGTCCGACT-3 3PO-31 rev: 5 -GTCGACACCGCCAGGGGTCGGGTTGCCGGGCGCAGCGGTAGGCGAGAGAGGATGCAAGTC-3 BLOCK 3 OB-1 Lac3 3PO Oligonucleotides OB1-31 for: 5 -ACCGCTGCGCCCGGCAACCCGACCCCTGGCGGTGTCGAC-3 OB1-442 rev: 5 -GGAAGTCGATGTGGCAGTGGAGGAACCACGGGCCGGGGTT-3 Lac3-31 for: 5 -ACCGCTGCGCCCGGCAACCCGACCCCTGGCGGTGTCGACCTGGCGATGAACCTGGAGTTC-3 Lac3-442 rev: 5 -GGAAGTCGATGTGGCAGTGGAGGAACCACGGGCCGGGGTTGTTCGTCGTGAACC-3 3PO-31 for: 5 -ACCGCTGCGCCCGGCAACCCGACCCCTGGCGGTGTCGACAAGCCTCTGAACTTGGTCTTC-3
BLOCK 4 3PO-442 rev: 5 -GGAAGTCGATGTGGCAGTGGAGGAACCACGGGCCGGGGTTATTGGTCTCGAAGCG-3 Oligonucleotides OB-1 Lac3 3PO OB1-442 for: 5 -AACCCCGGCCCGTGGTTCCTCCACTGCCACATCGACTTC-3 RMLC rev: 5 -GCTTACATTCACGCCCTCCC-3 Lac3-442 for: 5 -AACCCCGGCCCGTGGTTCCTCCACTGCCACATCGACTTT-3 RMLC rev: 5 -GCTTACATTCACGCCCTCCC-3 3PO-442 for: 5 -AACCCAGGCCCGTGGTTCCTCCACTGCCACATTGACTTC-3' RMLC rev: 5 -GCTTACATTCACGCCCTCCC-3
14 SUPPLEMENTARY MATERIAL AND METHODS SCHEMA-RASSP Scripts for selecting crossover points find_homology.py Reads a rasppcurve.py output file and rewrites it to include homology scores for the crossover points ''' import find_homology as fh import os, argparse, sys def main(): dir = os.path.dirname( file ) parser = argparse.argumentparser() parser.add_argument('-i','--in_file',required=true) parser.add_argument('-o','--out_file',required=false) parser.add_argument('-a','--align_file',required=true) parser.add_argument('-w', '--window',required=true, type=int) args = parser.parse_args() # open the input and output files try: f_in = open(os.path.join(dir,args.in_file),'r') except: sys.exit ('Input file does not exist') if args.out_file == None: f_out = open (os.path.join(dir,args.in_file.split('.')[]+'_scored.txt'),'w') elif args.out_file == 'stdout': f_out = sys.stdout else: f_out = open (os.path.join(dir,args.out_file),'w') # make the scorelist a_file = os.path.join (dir, args.align_file)
15 window = (args.window-1)/2 score_list = fh.score_homology_from_file (a_file, window) # read the input file inp = f_in.read().splitlines() # figure out what line the first chimera is on (finds first line that does # not begin with '#' for i,line in enumerate(inp): if line[]!= '#': first_ch_line = i break # write the header lines to the output file f_out.write ('SCHEMA output edited to include homology scores\n') f_out.write ('The window for homology is %d\n' %args.window) for i in range(first_ch_line-1): f_out.write (inp[i] + '\n') n_xo = len(line.replace('\t',' ').split(' ',))-2 spaces= ''.join([' ' for n in range(n_xo*4-17)]) f_out.write (inp[first_ch_line-1]+spaces+'min mean\n') # for each of the following lines, get the crossover points and the scores # then write the line and the new data for line in inp[first_ch_line:-1]: xo_locs = [int(p) for p in line.replace('\t',' ').split(' ',)[2:-1]] xo_scores = fh.score_locations(xo_locs, score_list, zeroed=false) f_out.write(line+'\n') f_out.write('homology scores: ') f_out.write(''.join(['%d ' %l for l in xo_scores['scores']])) f_out.write(' %d %.1f\n' %(xo_scores['min'], xo_scores['mean'])+'\n') # close input and output files f_in.close() f_out.close()
16 if name == " main ": main() add_scores.py Reads a rasppcurve.py output file and rewrites it to include homology scores for the crossover points ''' import find_homology as fh import os, argparse, sys def main(): dir = os.path.dirname( file ) parser = argparse.argumentparser() parser.add_argument('-i','--in_file',required=true) parser.add_argument('-o','--out_file',required=false) parser.add_argument('-a','--align_file',required=true) parser.add_argument('-w', '--window',required=true, type=int) args = parser.parse_args() # open the input and output files try: f_in = open(os.path.join(dir,args.in_file),'r') except: sys.exit ('Input file does not exist') if args.out_file == None: f_out = open (os.path.join(dir,args.in_file.split('.')[]+'_scored.txt'),'w') elif args.out_file == 'stdout': f_out = sys.stdout else: f_out = open (os.path.join(dir,args.out_file),'w') # make the scorelist a_file = os.path.join (dir, args.align_file) window = (args.window-1)/2
17 score_list = fh.score_homology_from_file (a_file, window) # read the input file inp = f_in.read().splitlines() # figure out what line the first chimera is on (finds first line that does # not begin with '#' for i,line in enumerate(inp): if line[]!= '#': first_ch_line = i break # write the header lines to the output file f_out.write ('SCHEMA output edited to include homology scores\n') f_out.write ('The window for homology is %d\n' %args.window) for i in range(first_ch_line-1): f_out.write (inp[i] + '\n') n_xo = len(line.replace('\t',' ').split(' ',))-2 spaces= ''.join([' ' for n in range(n_xo*4-17)]) f_out.write (inp[first_ch_line-1]+spaces+'min mean\n') # for each of the following lines, get the crossover points and the scores # then write the line and the new data for line in inp[first_ch_line:-1]: xo_locs = [int(p) for p in line.replace('\t',' ').split(' ',)[2:-1]] xo_scores = fh.score_locations(xo_locs, score_list, zeroed=false) f_out.write(line+'\n') f_out.write('homology scores: ') f_out.write(''.join(['%d ' %l for l in xo_scores['scores']])) f_out.write(' %d %.1f\n' %(xo_scores['min'], xo_scores['mean'])+'\n') # close input and output files f_in.close() f_out.close()
18 if name == " main ": main() Details of PCR reactions In the first PCR reaction (template = OB-1) RMLN was used as forward primer and OB1-168 rev as reverse primer. In the second PCR reaction (template = Lac3) RMLN was used as forward primer and Lac3-168 rev as reverse primer. In the third PCR reaction (template = 3PO) RMLN was used as forward primer and 3PO-168 rev as reverse primer. Block 2 was amplified in 3 independent PCR reactions (1 PCR for each parental type). In the first PCR reaction (template = OB-1) OB1-168 for was used as forward primer and OB1-31 rev as reverse primer. In the second PCR reaction (template = Lac3) Lac3-168 for was used as forward primer and Lac3-31 rev as reverse primer. In the third PCR reaction (template = 3PO) 3PO-168 for was used as forward primer and 3PO-31 rev as reverse primer. Block 3 was amplified in 3 independent PCR reactions (1 PCR for each parental type). In the first PCR reaction (template = OB-1) OB1-31 for was used as forward primer and OB1-442 rev as reverse primer. In the second PCR reaction (template = Lac3) Lac3-31 for was used as forward primer and Lac3-442 rev as reverse primer. In the third PCR reaction (template = 3PO) 3PO-31 for was used as forward primer and 3PO-442 rev as reverse primer. Block 4 was amplified in 3 independent PCR reactions (1 PCR for each parental type). In the first PCR reaction (template = OB-1) OB1-442 for was used as forward primer and RMLC as reverse primer. In the second PCR reaction (template = Lac3) Lac3-442 for was used as forward primer and RMLC as reverse primer. In the third PCR reaction (template = 3PO) 3PO-442 for was used as forward primer and RMLN as reverse primer.