Skip to main content

Downstream processing of recombinant human insulin and its analogues production from E. coli inclusion bodies

Abstract

The Global Diabetes Compact was launched by the World Health Organization in April 2021 with one of its important goals to increase the accessibility and affordability of life-saving medicine—insulin. The rising prevalence of diabetes worldwide is bound to escalate the demand for recombinant insulin therapeutics, and currently, the majority of recombinant insulin therapeutics are produced from E. coli inclusion bodies. Here, a comprehensive review of downstream processing of recombinant human insulin/analogue production from E. coli inclusion bodies is presented. All the critical aspects of downstream processing, starting from proinsulin recovery from inclusion bodies, inclusion body washing, inclusion body solubilization and oxidative sulfitolysis, cyanogen bromide cleavage, buffer exchange, purification by chromatography, pH precipitation and zinc crystallization methods, proinsulin refolding, enzymatic cleavage, and formulation, are explained in this review. Pertinent examples are summarized and the practical aspects of integrating every procedure into a multimodal purification scheme are critically discussed. In the face of increasing global demand for insulin product, there is a pressing need to develop a more efficient and economical production process. The information presented would be insightful to all the manufacturers and stakeholders for the production of human insulins, insulin analogues or biosimilars, as they strive to make further progresses in therapeutic recombinant insulin development and production.

Introduction

Diabetes is a global epidemic (WHO 2021). In April 2021, the World Health Organisation has launched the Global Diabetes Compact to hasten action to tackle diabetes worldwide. More than 420 million (6% of the world’s population) were living with diabetes before COVID-19 emerged, and it is predicted that this number would surge to 500 million by 2030, and to 700 million by 2045 (WHO 2021). The rise in the number of diabetics worldwide would lead to the consequential increase in the demand for insulin. In addition, there is an unmet demand for affordable insulin, especially in low- and middle-income nations. Many people with type 1 diabetes who are completely reliant on insulin for survival have no access to insulin (WHO 2021). For the 60 million people with type 2 diabetes who need insulin treatment, 1 in 2 of them do not get insulin due to its price (WHO 2021). In 2018, the world human insulin market size was USD 21.26 billion and this number is expected to reach USD 27.71 billion by 2026 (Fortune Business Insights 2020). The human insulin market is forecasted to expand at a compound annual growth rate (CAGR) of 3.4% from 2019 to 2026. In terms of human insulin market dominance, three major companies, Sanofi, Novo Nordisk A/S and Elli Lily and Company, are collectively holding more than 90% share of the market revenue (Fortune Business Insights 2020).

Injections of insulin are important to treat both type 1 and type 2 diabetes, and recombinant human insulin has been shown to have significant advantages over insulins extracted from pork and beef sources. In 1978, scientists at City of Hope first successfully produced recombinant human insulin in partnership with Genentech (Chance and Frank 1993). Eli Lilly and Company launched human insulin of recombinant DNA origin in 1982, while Novo launched it in 1988. Two different production approaches have been reported from Eli Lilly and Company. In the first approach, the A and B chains of insulin were cultured separately in bacteria as inclusion bodies. Then, separate purifications of the two chains were performed and later combined chemically, followed by eventual steps of purification. In the second technique, the expression of proinsulin in Escherichia coli took the form of a tryptophan promotor with a methionine linkage to proinsulin. Cyanogen bromide (CNBr) was used to cleave the linkage, followed by folding of the peptide via correct disulfide bonds formation, and the eventual elimination of the C-peptide via enzymatic action (Frank et al. 1981). It is preferable to adopt the “proinsulin route” as it only requires a single fermentation and purification procedure, which makes the production process more efficient in comparison to the two-chain combination method. Since 1986, the “proinsulin route” has been adopted commercially (Chance et al. 1999). Eli Lilly used this technology in partnership with Genentech to produce Humulin, the first of several recombinant insulins to be approved for general medical use (Walsh 2005).

Besides using bacteria as the host system, yeast has also been included in the commercial production of insulin. The predominant yeast strains used commercially are Saccharomyces cerevisiae and Pichia pastoris. Yeast-based expression system yields soluble insulin precursors which are secreted into the culture supernatant. There are different pros and cons to using bacteria and yeast as expression systems. Production of precursor insulin via the bacterial inclusion body route typically yields higher product concentration and productivity (Baeshen et al. 2014). Compared to soluble expressions of bioactive therapeutic proteins, the inclusion bodies have good mechanical stability and are resistant to proteolytic degradation (Singhvi et al. 2020). In terms of downstream purification, the protein of interest from inclusion bodies is also isolated in a purer and more concentrated state compared to secreted proteins which require the removal of abundant host cell protein impurities from the culture supernatant. The isolation of peptide from inclusion bodies also tends to be easier due to the differences in their size and density as compared with host proteins (Singhvi et al. 2020).

Using E. coli as the expression system for large-scale recombinant insulin production possesses the advantages of high growth rate, simple media requirement, ease of handling, high yield, and cost effectiveness (Baeshen et al. 2014). E. coli also has well-characterized genetics, and an availability of a huge quantity of mutant host strains and cloning vectors (Baneyx 1999). Nevertheless, the inclusion body route of production from E. coli requires complicated and extensive processing, such as solubilization and refolding procedures, to obtain fully functional polypeptides. Table 1 provides a list of insulin products currently on the market and their host systems. For most of the biopharmaceutical companies, E. coli remains the host system of choice to produce recombinant insulin, with human insulin and its analogues expressed in inclusion bodies in most instances. Insulin analogues are synthetically produced variations of insulin that have a different amino acid sequence to native human insulin (Chouhan et al. 2017). Such alterations have been made to more closely mimic the normal physiologic pattern of insulin secretion in the human body (Freeman 2009).

Table 1 Commercial recombinant human insulin/insulin analogue products and their production host systems

The single-chain insulin precursor molecules sequestered in E. coli inclusion bodies are mostly misfolded. The inclusion bodies are extracted from the cells, washed, and the precursor molecules are then solubilized. Production of proinsulin requires both peptide folding and concomitant formation of disulfide bonds. After enzymatic reaction, the refolded precursor molecule is then converted into a heterodimer insulin molecule by the removal of the C-chain and the N-terminus fusion peptide. Various chromatographic purification steps are required to remove host cell proteins, nucleic acids, cell membrane fragments, and digestion by-products, to yield a highly purified product. The general workflow for the downstream processing of recombinant human insulin and its analogues is summarized in Fig. 1.

Fig. 1
figure1

The general workflow for the downstream processing of recombinant human insulin and its analogues (created with BioRender.com)

The downstream process to purify recombinant human insulin/analogues from E. coli inclusion bodies is complex and involves multiple steps, and the specifics of the processes established by different companies are proprietary. The approaches adopted in each step will have a big impact on the next step in terms of product purity and yield. Here, we present a comprehensive review of downstream processing of recombinant human insulin/analogues production from E. coli inclusion bodies. All the critical aspects of downstream processing of recombinant human insulin/analogues produced in E. coli inclusion bodies will be summarized and discussed.

Recovery of inclusion bodies

The heterologous proteins are expressed in the form of insoluble cytoplasmic inclusion bodies and they are not being excreted into the culture media. Therefore, the inclusion bodies have to be recovered from the bacterial cells by either a mechanical or lysozyme-based method (Astolfi Filho et al. 2004; Singh et al. 2015). Mechanical methods to rupture the cell envelope involve using sonication, grinding the cell suspension in a colloidal mill such as a Dyno-Mill, or by passing the cell paste through a Manton-Gaulin press or French press. In a lysozyme-based method, the lysozyme is added to digest the cell envelope. Thereafter, the inclusion bodies, which are denser than the other cellular components, can be easily isolated from the whole cell lysate by techniques such as membrane microfiltration or centrifugation. Generally, mechanical methods can effectively disrupt cells, but may compromise the protein quality in inclusion bodies more than lysozyme-based methods (Singhvi et al. 2020). Neither centrifugation nor membrane filtration has any effect in the extraction and washing of inclusion bodies (Tikhonov et al. 2001).

Inclusion body washing

Inclusion bodies recovered from bacterial cell lysates can be heavily contaminated with intact whole cells, host proteins, RNA, peptidoglycan cell wall, and membrane fragments (Schoner et al. 1985; Singh et al. 2015). After lysis of the E. coli cells, the inclusion bodies have to be washed sequentially to remove contaminants which often have strong ionic and hydrophobic interactions with the inclusion body proteins (Astolfi Filho et al. 2004). Inclusion body washing is critical in recombinant insulin purification, without which numerous impurities will persist and may interfere with the following steps, such as sulfitolysis, renaturation, and enzymatic digestion (Min et al. 2011). This could lead to a reduction in purification yield.

Recovery of washed inclusion bodies is performed by multiple rounds of centrifugation combined with re-suspension and washing of pellets with detergents and denaturants. After the wash, the supernatant is discarded to leave behind the inclusion body pellet. Alternatively, instead of using centrifugation, isolation of inclusion bodies from bacteria can also be achieved via membrane filtration (Yuan et al. 2015).

A list of additives used in washing the proinsulin fusion protein-containing inclusion bodies are summarized in Table 2. The two most common wash buffer additives are urea and Triton X-100. The E. coli cell wall is constituted of phospholipid, protein, peptidoglycan, and lipopolysaccharide (Clark 1998). Selective extraction with detergents, low concentrations of urea, lysozyme, and EDTA facilitates the removal of these bacterial cell wall components. Typically, the pH of the wash buffers ranged from pH 7.4 to pH 8.0. Wash optimization is required to evaluate the efficacy of additives in removing impurities. It is also important to be mindful of ensuring the wash buffer does not inadvertently solubilize the desired recombinant protein, which may negatively impact its recovery. Other time and cost-saving optimization include number of centrifugation rounds, speed of centrifugation, and wash temperature.

Table 2 A list of additives, their specific functions, and pertinent examples of proinsulin fusion protein-containing inclusion body washes

Inclusion body solubilization and oxidative sulfitolysis

Inclusion bodies contain protein in a stable non-native conformation. The protein aggregates may be amorphous, with partial or complete denaturation (Astolfi Filho et al. 2004). Inclusion bodies are relatively insoluble in aqueous buffers and this has introduced substantial challenges in purification. Therefore, the washed inclusion bodies have to be solubilized with solubilization buffer solution to recover the recombinant protein.

Inclusion bodies are conventionally solubilized using high concentration of denaturants, such as guanidine hydrochloride (GdnHCl) and urea, which results in a complete disruption of protein structure (Singh et al. 2015). For proteins that contain numerous cysteine residues, dithiothreitol (DTT) or β-mercaptoethanol (BME) may be added to reduce incorrect disulfide-bond formation (Harrison et al. 2015). Triton X-100 and sodium dodecyl sulfate can be used to extract proteins from inclusion bodies (Astolfi Filho et al. 2004), though their usage is rather uncommon in proinsulin fusion protein solubilization.

Oxidative sulfitolysis involves the addition of –SO3 groups to the –SH of cysteine residues in proinsulin polypeptides to form protein-S-sulfonate in the presence of a high concentration of denaturing reagent (e.g., 6 M GdnHCl or 8 M urea) (Frank et al. 1981; Petrides et al. 1995). As proinsulin expressed in E. coli is not folded in the correct conformation, the sulfitolysis step is crucial to maintain its unfolded form (Petrides et al. 1995). This will prevent the formation of potentially incorrect disulfide bonds in solubilization and during initial purification steps preceding protein renaturation (Harrison et al. 2015). Furthermore, oxidative sulfitolysis can aid in improving the subsequent refolding yield (Min et al. 2011).

Some workflows would start with inclusion body solubilization first, followed by oxidative sulfitolysis (Min et al. 2011; Petrides 1995). The conditions for solubilization of proinsulin fusion protein-containing inclusion bodies (without concurrent oxidative sulfitolysis), ranked in order of ascending duration of solubilization, are presented in Table 3. The 8 M urea is the most common solubilization agent. Reducing agents (e.g., BME, DTT) are added to break the disulfide bonds. Table 4 presents examples in which inclusion body solubilization was carried out simultaneously with oxidative sulfitolysis in human insulin/analogue production, and these conditions are listed and ranked in order of ascending duration of reaction. Sulfitolysis of denatured proinsulin is carried out with sodium sulfite and sodium tetrathionate at a molar ratio of at least 2:1 (Table 4). In oxidative sulfitolysis, sodium sulfite assumes the role as a reducing agent. Sodium tetrathionate, as an oxidizing agent, is preferred over cystine, Ni2+ and Cu2+ ions, as the former requires a shorter reaction time to reach maximum yield (Min et al. 2011; Tikhonov et al. 2001). To achieve maximal reaction effectiveness, it is critical to use only freshly prepared tetrathionate (Tikhonov et al. 2001). If not, the formation of side products could affect the yields and subsequent folding step.

Table 3 Conditions for solubilization of inclusion bodies reported in human insulin/analogue downstream processing, ranked in order of ascending duration of solubilization
Table 4 Oxidative sulfitolysis conditions reported in human insulin/analogue downstream processing, ranked in order of ascending duration of reaction

pH and temperature are factors which could affect the solubilization and sulfitolysis rate of reaction. The solubilization and sulfitolysis reaction are typically carried out under alkaline conditions above pH 8 (Tables 3, 4). Min et al. (2011) and Kim et al. (2015) have reported using pH adjustment to 11 then to 9.5 to achieve higher protein solubility. The pH adjustment to 10.5 could attain complete protein solubilization (Zimmerman and Stokell 2010). Raising both the pH (from pH 9.5 to 11) and temperature (from 4 to 25 °C) of the sulfitolysis buffer will shorten the reaction time from 12 to 3 h (Min et al. 2011). The sulfitolysis reaction can be completed in 40 min at 37 °C, as compared to 2–3 h at room temperature (Tikhonov et al. 2001).

To achieve maximum yield, it is important to monitor the temperature and duration of the sulfitolysis reaction to ensure that the reaction goes to completion. However, going beyond what is required could be undesirable for the following fusion protein isolation and folding steps as the harsh conditions could lead to irreversible changes in protein structure (Tikhonov et al. 2001). Analysis of fraction aliquots on a Mono-Q column can be performed to monitor the progression of sulfitolysis reaction (Astolfi et al. 2004). A straightforward way to stop the reaction is by diluting the sample with water (Castellanos-Serra et al. 1996; Kim et al. 2015; Min et al. 2011; Nilsson et al. 1996) or freezing on dry ice (Cowley and Mackin 1997). Buffer exchange and desalting can be carried out to remove the sulfitolysis salts (see "Buffer exchange" section).

Cyanogen bromide (CNBr) cleavage

In E. coli protein translation, the recombinant protein must be translated as a fusion protein in which the N-terminal extension provides the initiator N-formylmethionine (fMet) (Laursen et al. 2005). fMet is a modified methionine used as the first amino acid in most bacterial proteins. Since fMet is recognized by the human immune system as a foreign body, it is important to remove it so as to avoid unwanted immunogenic reaction. To remove the signal sequence, the methionine linker of proinsulin can be cleaved off with cyanogen bromide (CNBr) before purification. In short, this process involves the suspension of protein sample in 70% formic acid and incubation with CNBr at room temperature in darkness for 12–16 h (Cowley and Mackin 1997; Mackin and Choquette 2003; Petrides et al. 1995). The disadvantages of this method are low cleavage specificity, prolonged evaporation of CNBr, high toxicity and volatility of CNBr, and possible chemical modifications of the released products (Mackin and Choquette 2003). An alternative method uses protein proteases to cleave the fusion proteins, though one has to be cautious to ensure that the protease cleaves at the correct site to remove N-terminal fused peptide.

Buffer exchange

Some of the reagents used in protein extraction, sample preparation, and purification may have adverse effects on protein function and stability. This could interfere with subsequent downstream processes. Therefore, it is necessary to remove or reduce these contaminants using one or more protein clean-up methods. The aim is to make the extracted or purified protein samples compatible with subsequent downstream applications. Here, we discuss the usage of buffer exchange techniques (i.e., dialysis, diafiltration, and size exclusion chromatography) in human insulin/analogue production.

Dialysis

Dialysis is the traditional method for desalting or buffer exchange, using osmotic pressure to drive solutes across a membrane (Merck Millipore 2021). The main disadvantages of dialysis are extended time taken to complete the exchange and the requirement of a large surface membrane area for exchange. It is however suitable for sensitive proteins which precipitate easily. In laboratory scale, the solution containing solubilized proinsulin-containing inclusion bodies can be dialyzed to eliminate urea (Nilsson et al. 1996).

Diafiltration

Diafiltration achieves desalting or buffer exchange through the use of centrifugal force or other external pressure to drive small microsolutes through a porous membrane (Merck Millipore 2021). The membrane does not allow macrosolutes bigger than the pore size to pass through. The main advantage of diafiltration lies in its ability to concentrate protein samples.

In the industry, diafiltration is applied after major reaction steps to remove interfering reagents. For example, diafiltration is applied after inclusion body solubilization and sulfitolysis reaction to remove the high concentration of urea and sulfitolysis reagents (Petrides et al. 1995). Besides that, the renatured samples can be concentrated and buffer exchanged by diafiltration into a suitable buffer for enzymatic conversion to occur (Astolfi et al. 2004; Nilsson et al. 1996; Petrides et al. 1995; Zimmerman and Stokell 2010). A recovery yield as high as 95%–98% has been reported with diafiltration (Petrides et al. 1995).

Size exclusion chromatography (SEC)

The third method for desalting and buffer exchange is to use size exclusion chromatography (SEC), also known as gel filtration chromatography. SEC separates molecules according to their relative sizes (Pharmacia Biotech 2021). Using a group separation technique, the small molecules, such as salts, can be separated from the larger peptides. SEC is a faster alternative to dialysis, and it requires a low dilution factor. For human insulin/analogue production, Sephadex G-25 resin has been widely used in desalting. It is useful for the removal of salts and other small impurities from molecules with molecular mass above 5000 Da (Cytiva 2021). The use of Sephadex G-25 for buffer exchange has been reported after sulfitolysis reaction (Cowley and Mackin 1997) and before proinsulin renaturation (Astolfi et al. 2004; Cowley and Mackin 1997). There was also a report using Sephadex G-25 after lysis, but the gel filtration media became heavily contaminated with non-chromatographable material that they were discarded after every experiment (Mackin 1999). It is also possible to use Sephadex G-25 after the RP-HPLC polishing step to exchange the sample buffer and to remove residual acetonitrile (Zieliński et al. 2019). Bio-Gel P2 was used to remove formic acid and remaining cyanogen bromide after fusion protein cleavage (Mackin and Choquette 2003).

Purification by chromatography and precipitation

Table 5 provides a summary of reported downstream purification schemes for recombinant human insulin/analogues produced in E. coli inclusion bodies. The entire purification process is based on a combination of different modes of chromatography which exploits differences in size, molecular charge, and hydrophobicity. After recovering fusion precursor peptide from inclusion body, the following capture step usually entails affinity chromatography (AC) or ion-exchange chromatography (IEX). It is important to ensure that the intermediate product is of high purity prior to enzymatic conversion step. IEX is also the chromatography of choice following protein renaturation and enzymatic conversion step. For insulin polishing step, it is common to use reversed-phase chromatography (RP) and size exclusion chromatography (SEC). Some of the purification strategies have incorporated folding and cleavage step early in the scheme, thus completely eliminating the need to purify insulin precursor. The renaturation of insulin precursor immediately following their recovery from inclusion body also eliminates the need to work with a high concentration of urea.

Table 5 A summary of reported downstream purification schemes for recombinant human insulin/analogues and their precursors produced in E. coli

Some of the more unconventional techniques in insulin downstream processing have been explored. The use of simulated moving bed (SMB), in place of traditional batch chromatography mode, has been reported for the purification of insulin (Wang et al. 2003). The SMB mode uses solid phase much more efficiently and require much less column volume for the same throughput. Furthermore, it can produce products at a purity similar to or higher than that of batch chromatography, and at substantially higher yields. There have been investigations on the use of protein-folding liquid chromatography (PFLC) to achieve simultaneous purification and renaturation of proinsulin (Bai et al. 2003; Yuan et al. 2015). A proof-of-concept using sub/supercritical fluid chromatography (SFC) to replace RP in insulin purification has been proposed (Govender et al. 2020). The SFC is more ecologically friendly as the major mobile phase, carbon dioxide, can be efficiently recycled. One of the SFC columns tested, the pentafluorophenyl column, was shown to yield a recovery of 84% in contrast to conventional RP-HPLC methods of ≥ 75%.

Affinity chromatography (AC)

AC is a separation technique based on specific binding interaction between an immobilized ligand and its binding partner. Depending on the specificity of the interaction, the degree of purification can be quite high. Some of the reported usage of AC in proinsulin purification is presented in Table 6. For AC to work, a suitable fusion tag has to be fused to the proinsulin molecule. Table 7 provides a list of fusion tags adopted in various insulin purification schemes and their advantages. Polyhistidine tag is the most commonly adopted fusion tag in insulin production. For affinity capture of His-tagged proinsulin, it is common to use nickel column with imidazole elution. Sample loading in the presence of a low concentration of imidazole and a moderate concentration of salt significantly reduced nonspecific binding of contaminating proteins (Mackin 1999). As high concentrations of urea are compatible with immobilized metal ion affinity chromatography (IMAC), it is useful to purify denatured proinsulin from inclusion bodies.

Table 6 Affinity chromatography in proinsulin (PI) purification
Table 7 A list of fusion tags used in insulin purification and their advantages

A well-designed fusion tag is desirable for both upstream and downstream processes. Besides the obvious function for purification, the fusion tag can improve the stability of recombinant human insulin/analogue. This is especially useful given that the proinsulin is a short-length peptide and is prone to degradation (Min et al. 2011). Other than that, an appropriately engineered fusion tag can achieve an improvement in proinsulin peptide expression levels (Min et al. 2011; Nilsson et al. 1996). The introduction of hydrophilic amino acids to fusion tag enhanced the solubility of the molecule and resulted in an improvement in renaturation yield downstream (Min et al. 2011; Nilsson et al. 1996).

Gene fusions may be tricky to handle in downstream processing, owing to the fact that the fusion tail has to be cleaved off and removed. To avoid using the toxic CNBr to cleave fusion proteins, it is possible to engineer lysine and arginine linkers that serve as trypsin cleavage site to join the fusion protein and proinsulin together. In this case, a simultaneous removal of both C-peptide and N-fused sequence can be achieved in just a single enzymatic conversion step (Min et al. 2011; Nilsson et al. 1996). If the fusion tag has a molecular weight and hydrophobic properties similar to the insulin molecule, after enzymatic conversion, the two molecules will be poorly resolved during analyses such as on RP-HPLC and SDS-PAGE. Finally, the cost of resin also constitutes an important factor for consideration. The AC resin is generally more expensive than IEX resin. It may be cost-effective to switch to IEX as the capture step if the latter has equal or better purification performance. Nonetheless, the fusion tag can still be preserved for the aforementioned benefits other than for affinity binding.

Ion-exchange chromatography (IEX)

IEX involves the separation of ionizable molecules based on their total charge. As the human insulin/analogues and its precursor have both anionic and cationic groups, it is possible to use anion exchange chromatography (AEX) and/or cation exchange chromatography (CEX) for purification. The significant negative charge of S-sulfonated fusion protein can bind strongly to AEX. However, negatively charged non-protein impurities such as nucleic acids can also compete for binding. Therefore, in some instances, it is advantageous to use CEX. Table 8 provides a list of applications which have reported using IEX in the purification of human insulin/analogue and its precursor. IEX is a versatile technique that can purify either a folded insulin/insulin precursor or a sulphonated proinsulin in denaturing condition. Commonly, IEX is adopted as the capture step of precursor insulin or as the first step after enzymatic cleavage. Its high binding capacity and the possibility of including multiple wash steps enable the removal of a large number of contaminants which may complicate subsequent purification. These contaminants comprised of host cell impurities (protein, DNA, endotoxin), proteolytic enzymes, and product impurities produced during enzymatic digestion (excised fragments and miscleavages).

Table 8 Ion-exchange chromatography (IEX) in proinsulin (PI) and insulin purification

IEX is characterized by a charged surface on stationary phase, and the use of buffer, salt, and pH control on mobile phase composition. After the impure protein sample was loaded onto an IEX column, the column is washed to remove undesired proteins and other impurities. Using either a salt or pH gradient, the protein of interest is then eluted. It is more common to elute the insulin precursor/insulin with sodium chloride compared to using a pH gradient (Table 8). In salt gradient elution, the salt ions compete with bound proteins for the charged functional groups on the resin. Proteins with many charged groups will elute at high salt concentrations and thus have greater retention times. On the other hand, proteins with few charged groups will elute at earlier retention times. In pH gradient elution, the number and type of ionizable amino acid side chain groups will determine the charge on the protein. At the protein’s isoelectric point (pI), the net charge of the protein becomes zero. Protein elution occurs at the point when the pH gradient meets their pI as the protein no longer have a net charge that allows binding to the resin. An increasing pH gradient is used to elute protein from a cation exchange resin, whereas a decreasing pH gradient is used to elute proteins from an anion exchanger.

It is important to take into consideration the pI of the insulin or insulin analogue during IEX purification as it will influence the choice of resin. Insulin analogue may have a pI that is different from the native insulin. For example, the pI of native insulin is 5.4–5.6; however, the pI of an acid-stable insulin analogue (e.g., insulin glargine) is about 6.7–7.0 (Coleman et al. 2019). A weak cation exchanger may be applicable for insulin analogues with a pI greater than the pI of native insulin, whereas a strong cation exchanger is used for insulin analogues with a pI similar to that of native insulin (Coleman et al. 2019). A strong cation exchanger is ionized across a wide range of pH levels, whereas a weak cation exchanger is ionized within a narrower pH range and they start to lose their ionization below pH 6.

The stability and solubility of insulin precursor/insulin is another factor for consideration. Insulin/insulin analogue is soluble in acidic condition, but it has a limited solubility close to its pI and neutral pH (Sigma Aldrich 2021; Wintersteiner and Abramson 1933). In addition, the insulin molecule is unstable at both the extremes of pH. At a high pH, there is an increased risk of deamidation and aggregation (Helmerhorst and Stokes 1987). On the other hand, a pH that is overly acidic also poses the risk for deamidation (Brange and Langkjœr 1993). Hence, a suitable pH range for purification of insulin with CEX is typically set at around pH 3 to 4. In a salt gradient elution, the pH can be used to refine the eluted peak resolution. An optimal pH should provide maximal binding of insulin peptide and minimal binding of contaminants.

A vital consideration for IEX purification of insulin precursor/insulin lies in the buffer composition during loading, washing, and elution. Acetic acid is a preferred buffer in CEX as it has been shown to reduce the risk of insulin fibrillation (Whittingham et al. 2002). The addition of organic solvents/modifiers to the eluent can help to maintain good protein solubility. For example, the addition of ethanol may increase the solubility of insulin as the molecule is relatively hydrophobic (GE application note 29-0018-56 AB 2012; Mollerup and Frederiksen 2016). The presence of ethanol may improve chromatographic performance, such as by decreasing elution volume and increasing yield (Heldin et al. 2014). The wash buffers should be optimized with respect to pH and salt content to ensure maximal removal of impurities.

Some impurities may be very challenging to remove, especially if they carry the same charge as the target peptide. It has been found that in the presence of di- or polyvalent metal ions in binding buffer, the insulin peptides are capable of self-association or having structural change (Mollerup and Frederiksen 2016). This results in an improvement in the control of peak shapes and leads to a good resolution of the protein of interest from closely related impurities. For example, the addition of Zn2+ to the binding buffer could result in a fronting peak shape of the insulin peptide, which have moved away from the earlier eluting insulin-related impurities (Mollerup and Frederiksen 2016). Also, a steep insulin peak tail makes it possible to obtain a very concentrated pool of purified insulin. This is advantageous in industrial production scale, in which the load can be increased while maintaining the same capacity to remove impurities.

Reversed-phase chromatography (RP)

RP involves separation of molecules based on the hydrophobic interactions between ligands attached to the stationary phase and solute molecules in the mobile phase. There is a difference in mechanism by which polypeptides interact with the RP surface, as compared to the small molecules (Carr 2016). In the separation of small molecules, there is continuous partitioning of the molecules between the mobile phase and the hydrophobic stationary phase. On the other hand, polypeptides are too big to partition into the stationary phase. The hydrophobic “foot” of the polypeptide remains adsorbed to the hydrophobic surface of the stationary phase up to the point a specific organic modifier concentration is reached, and this will desorb the polypeptide.

Table 9 presents reported applications of RP in the human insulin/analogue purification scheme. RP is usually placed after the enzymatic conversion of proinsulin to insulin. Following enzymatic digestion, the purification of insulin or insulin analogues usually requires two to three orthogonal chromatographic purification steps (Table 5). Purification with RP complements IEX and SEC by providing selectivity based on differences in hydrophobicity.

Table 9 Reversed-phase chromatography (RP) in proinsulin (PI) and insulin purification

RP is typically placed late in the overall purification scheme (e.g., after IEX) to allow the preceding step to remove a majority of the host cell-related impurities. This will be beneficial in prolonging the effective lifetime of RP stationary phase as it is prone to fouling as a result of irreversible binding or low solubility of contaminants (Kroeff et al. 1989). This is critical as the silica-based packings have pH limitations which prevent the use of pH extremes for chromatography clean-up and regeneration.

The high selectivity and resolving capability of RP is the foundation for its widespread use as a “polishing” step in the purification of human insulin/analogue. A key challenge with the purification of human insulin/analogues is the downstream elimination of process-related impurities arising from insufficient cleavage of the insulin precursor molecule or miscleavage. Such product impurities include C-peptide, N-terminal signal sequences, dipeptides, aggregated insulin, misfolds, miscleaves, deamidated insulin, and any residual proinsulin (Watson et al. 2018). Often, minor modifications on the insulin molecule result in the generation of A21 desamido insulin (hydrolysis of A21 asparagine to aspartic acid), B30 des-threonine insulin (deletion of B30 threonine), or derivatization of amines by formylation or carbamoylation (Petrides et al. 1995). The most significant ones among these contaminants are A21 desamido insulin and B30 des-threonine insulin (Balcerek et al. 2018). As des-threonine insulin carries the same net charge as insulin, IEX is not ideal in resolving the two molecules. Nonetheless, the molecules’ small differences in hydrophobicity make RP useful in removing the contaminant. The resemblance in chemical structure between human insulin/analogue and its product-related impurities, which may differ by only a single amino acid, has posed an uphill task in the purification of human insulin/analogue. However, there have been many reports which have demonstrated the capability of RP to resolve such close-related insulin variants and fragments generated after trypsin digestion (Table 9).

RP ligand with linear aliphatic hydrocarbon chain of eighteen (C18), eight (C8), or four (C4) carbons can be used in the purification of human insulin/analogue, with the most common one involving C8, followed by C18 and C4 (Table 9). C18 is usually preferred for small hydrophilic peptides in the range of 2–3000 Da, whereas C4 is most suitable for the separation of proteins, and large or hydrophobic peptides (Carr 2016). The size of insulin (at about 6000 Da) after digestion and its relative hydrophobicity makes C8 column, which is an intermediate between C4 and C18, ideal for insulin separation. It is best to try several different hydrophobic phases to ascertain which has the best selectivity for that specific mixture of peptides generated from protein digestion. Variation in the types of insulin analogues and fragments generated post trypsinization, may result in some differences in their “hydrophobic foot”. Also, subtle changes in RP adsorbent surfaces may result in differences in RP selectivity (Carr 2002).

Due to the highly hydrophobic nature of RP adsorbents, an organic solvent as the mobile phase has to be used in peptide elution. The ability of human insulin/analogue to tolerate such a harsh elution condition makes RP a suitable tool for purification. The organic solvent most commonly used in human insulin/analogue RP purification is acetonitrile (Table 9). Its long history of use, high volatility and thus easy removal after purification, low viscosity and thus low back pressure, transparency to low wavelength UV light, and ability to provide good chromatographic selectivity and peptide solubility make it preferable over many other solvents such as ethanol, isopropanol, and acetone (Carr 2016; Kroeff et al. 1989). There are also reports on the usage of isopropanol for insulin purification (Dickhardt et al. 1993; Dickhardt and Unger 1997; Mikiewicz et al. 2017; Watson et al. 2018). Isopropanol, which is less polar than acetonitrile, may enhance recovery or elution of hydrophobic insulin molecule.

The RP mobile phases used in insulin purification are generally adjusted to a low pH. At a low pH, protonation of the carboxylic acid groups on the side chains of glutamic and aspartic acids, and on the carboxy terminal group will occur and this will make the protein more hydrophobic (Carr 2016). This will increase the retention of peptides. Also, an acidic mobile phase typically elutes all the insulin derivative impurities after the insulin peak and this will simplify the collection step (Petrides 1995). In contrast, a buffer with mildly alkaline pH causes the elution of derivatives to be on either side of the parent insulin peak, thus sacrificing on the yield recovery and purity of the insulin pool.

An ideal pH for human insulin/analogue purification on RP is in the range of 3.0–4.0, as this pH range is under insulin’s isoelectric point to provide for good solubility (Kroeff et al. 1989; Ladisch and Kohlmann 1992). In addition, the mildly acidic conditions will reduce the formation of monodesamido (B-3) insulin generated at pH 7 or slightly above (Brange 2012; Kossiakoff 1988). As the typical pH range for RP on a silica-based packing is pH 2–8 (Verma 2021), the mildly acidic mobile phase will be within the pH tolerance of stationary phase. Stationary phase stability is critical in avoiding contamination of the human insulin/analogue with these potential breakdown products and ensuring a good column lifetime (Kroeff et al. 1989). Despite the advantages of using mildly acidic conditions, it is critical to perform the RP purification as rapidly as possible and avoid prolonged storage of human insulin/analogue in acidic condition to minimize deamidation (Ladisch and Kohlmann 1992).

It is important to select an appropriate acidic buffer as mobile phase in insulin purification. Insulin can form fibrillation (elongated insoluble aggregates) at a high salt concentration (Chatani et al. 2014; Muzaffar and Ahmad 2011). Thus, the mobile phase should be capable of maintaining a stable pH without the addition of salts. Acetic acid-containing mobile phase is a good option, because acetic acid is a weak acid that buffers in the region of pH 3 without the need for other salts (Kroeff et al. 1989; Petrides et al. 1995). Also, insulin can be readily crystallized from the acetic acid buffer by adding zinc chloride. A proprietary ion-pairing chromatography method on RP was reportedly able to improve the recovery of pure insulin collected as the des-threonine insulin impurity is displaced away from the insulin peak (Kromasil 2021).

It is also common to elute human insulin/analogue and impurities by a gentle linear change in gradient compared to using an isocratic elution (Table 9). This is due to a few reasons. First, the sensitivity of polypeptide retention to subtle changes in the organic modifier concentration makes isocratic elution challenging, due to the fact that any small changes in modifier concentration can significantly affect protein retention (Carr 2016). To separate impurities from the insulin peak in isocratic elution, the concentration of the organic modifier must be maintained very precisely in every batch. Second, a gradient elution of polypeptide yields much sharper peaks than in an isocratic elution (Carr 2016). A concentrated pool obtained from gradient elution makes handling and processing of human insulin/analogue much easier. Finally, a gradient elution can minimize the run time as compared to using isocratic elution in RP (Healthcare GE 2006).

Hydrophobic interaction chromatography (HIC)

HIC, similar to RP, separates molecules based on their hydrophobicity. However, unlike RP, HIC adsorbents are not as hydrophobic. Thus, organic solvents are not a prerequisite for successful elution. HIC is useful for purifying sensitive proteins as their biological activities are preserved under less denaturing chromatographic conditions.

There are many methods for HIC elution optimization—linear or stepwise decreasing gradient of a salt and/or zwitterion, a pH gradient, a temperature gradient, or a combination of the before-mentioned (Mollerup and Frederiksen 2016). It is also possible to adopt an elution gradient of a calcium chelating compound (e.g., citrate, EDTA, and malonate) or a solvent less polar than water (e.g., aqueous solutions comprising ethanol, 2-propanol, and polyethylene glycol (PEG)) (Mollerup and Frederiksen 2016). Using the salting-out effect, the retention of peptide can be modulated by adjusting the salt concentration (Johansson et al. 2015). It is however important to select a suitable salt for proinsulin purification on HIC, as the type of salt chosen for binding and elution can have an effect on product yield and purity (Bai et al. 2003).

Some of the example applications of HIC in proinsulin purification are presented in Table 10. As a gentler purification technique as compared to RP, it may be suitable to use HIC following proinsulin renaturation, so that the peptide conformation is maintained (Petrides et al. 1995). The use of HIC for a simultaneous purification and refolding of proinsulin has also been reported (Bai et al. 2003).

Table 10 Hydrophobic interaction chromatography (HIC) in proinsulin (PI) purification

Compared to RP, there are only few reports available on the use of HIC in human insulin/analogue production. Human insulin/analogue polishing step requires an extremely high-resolution separation of closely related impurities that have only minor differences in hydrophobicity. This may make RP preferable over HIC as RP is known to be the highest resolution chromatography technique available (Healthcare GE 2006). To ensure insulin adsorption on a HIC column, the feed should have the same high salt concentration as the mobile phase during elution. Therefore, another disadvantage of HIC is that diluted large insulin feed volume had to be used as the insulin solubility was reduced due to the high salt content in the feed. It has been reported that column load for HIC with butyl adsorbent was 0.01–2.4 g insulin/L column and 0.5–1.3 g insulin/L column for the phenyl adsorbent, which paled in comparison with the RP columns (2.9–11.7 g insulin/L column) (Johansson et al. 2015).

Size exclusion chromatography (SEC)

SEC, also known as gel filtration, is the mildest of all the chromatography techniques. SEC separates molecules by differences in size as they pass through the resin. There is no binding of the molecules to the chromatography resin, unlike chromatography techniques such as AC and IEX. SEC is typically used in the polishing step in human insulin/analogue purification (Table 5). It has also been adopted in protein-folding liquid chromatography (PFLC) for simultaneous purification and renaturation of proinsulin (Yuan et al. 2015). During elution with refolding buffer, the folded proinsulin was separated from the misfolded and aggregated forms. Table 11 presents examples of reported application of SEC in proinsulin and human insulin/analogue purification.

Table 11 Size exclusion chromatography (SEC) in proinsulin and insulin purification

Mixed-mode chromatography (MMC)

In recent years, MMC is becoming increasingly popular in human insulin/analogue purification. MMC utilizes more than one form of interaction between the stationary phase and analytes to achieve their separation (Zhang and Liu 2016). The multiple modes of interaction include affinity, ion exchange, size exclusion, hydroxyapatite, and hydrophobic interactions. The mixed-mode ion exchangers give a different selectivity as compared to the conventional ion exchangers. It also enables the possibility of proteins binding at a high salt condition, which eliminates the need for desalting or dilution prior to carrying out MMC. Table 12 presents examples of reported application of MMC in proinsulin and human insulin/analogue purification.

Table 12 Mixed-mode chromatography (MMC) in proinsulin (PI) and insulin purification

pH precipitation

The precipitation method has been adopted in human insulin/analogue downstream processing for a crude and rapid separation of human insulin/analogue from other impurities. This method can also be used to concentrate the protein for further purification steps.

Purification and concentration of the solubilized protein from inclusion bodies can be achieved by pH precipitation. As a protein has the lowest solubility at its isoelectric point (pI), it will precipitate out of the solution at its pI. Following that, centrifugation can be performed to recover the protein of interest. For example, solubilized fusion protein can be precipitated by lowering the pH to 5.0, centrifuged, and then re-dissolved in an alkaline buffer at pH 8.5 to enhance the purity of the solubilized protein (Astolfi et al. 2004). After sulfitolysis reaction, the S-sulfonated preproinsulin can be precipitated by adjusting the pH to 4.5, centrifuged, and the washed pellets can be solubilized again by raising the pH to 10.6 (Kim et al. 2015; Min et al. 2011). The pH precipitation step can also be carried out after proinsulin refolding (Hwang et al. 2016; Min et al. 2011; Thurow et al. 2010). The pH of the refolded proinsulin solution can be lowered to pH 4.5–5.5 to precipitate out the impurities, misfolded proinsulin, lipids, and other proteins that hinder subsequent enzymatic conversion and chromatography purification. The supernatant containing correctly folded proinsulin is collected by centrifugation. It is important to determine the exact pH required to precipitate the impurities, while maintaining the correctly refolded proinsulin in solution.

Zinc crystallization

In the presence of zinc ions, insulin is capable of self-association to form hexamers, di-hexamers, or bigger complexes of insulin peptide (Mollerup and Frederiksen 2016). The hexamer consists of three dimers closely associated through interacting with two zinc ions at its core positioned on the threefold axis. The aptitude of insulin to form crystals readily can be utilized in the purification scheme to isolate insulin from impurities that do not co-crystallize with it (Mollerup et al. 2010). Zinc crystallization is a convenient and efficient method to concentrate and hold the insulin temporarily prior to further processing (Kroeff et al. 1989). The hexameric form also protects insulin from physical and chemical degradation during storage (Mollerup and Frederiksen 2016). In proinsulin purification, the addition of 0.1 M to 0.5 M zinc chloride solution in between chromatography purification steps has been reported to precipitate out the eluted proteins (Redwan et al. 2007). After decitraconylation, 1 M zinc chloride solution was added to the insulin-containing sample and the pH was adjusted to 5.9 to precipitate out insulin (Zieliński et al. 2019). Zinc crystallization can be performed as soon as possible after the RP purification step to remove insulin from the acidic environment to minimize deamidation (Brange 1994).

Refolding

The insulin hormone is comprised of two polypeptide chains, an A chain of 21 amino acids and a B chain of 30 amino acids. These two chains are connected by two interchain disulfide bridges (A20-B19 and A7-B7). The A chain also features one intra-chain disulfide bridge (A6-A11). In vitro, the proinsulin is believed to spontaneously fold into its native structure through preferred kinetic intermediates (Jia et al. 2003). The refolding pathway of human proinsulin has been studied by capture and analysis of their folding intermediates. Although several intermediates with three disulfide bonds have been identified and may interconvert via disulfide rearrangements (Qiao et al. 2003; Hua et al. 2002), the formation of the A20-B19 disulfide bridge appears to be the key step in refolding process and is critical to maintain the native structure of the proinsulin (Guo et al. 2008; Qiao et al. 2006). Unsurprisingly, a study on three disulfide deletion mutants of human insulin revealed that the deletion of the A20-B19 disulfide bond had the greatest influence on the protein structure, while the deletion of A6-A11 disulfide bond had the least effect (Chang et al. 2003).

During in vitro renaturation, the high concentration of denaturant has to be removed to facilitate disulfide-bond formation and correct folding of proinsulin to its native form (Harrison et al. 2015). Dilution of the solubilized protein in refolding buffer is by far the most popular choice in insulin production. Other methods to refold proinsulin include protein-folding liquid chromatography (PFLC) (Yuan et al. 2015) and dialysis (Mackin 1999). The optimized conditions reported for proinsulin refolding via dilution are summarized in Table 13. The success of refolding depends in part on the composition of refolding buffer. Glycine–NaOH is the most common buffer used in proinsulin refolding as it buffers in the range of pH 8.6–10.6. The high pH leads to faster disulfide-bond formation and shuffling (Min et al. 2011). Refolding performed at pH 10.5 using 0.5 mM cysteine and 4.5 mM cystine as redox couple yielded a higher refolding yield (60%) compared to refolding at a lower pH of 8 which yielded only a 20% yield (Winter et al. 2002b).

Table 13 Reported optimized conditions for proinsulin (PI) refolding via dilution

To improve proinsulin refolding yield, chaotropic agents (e.g., urea and guanidine hydrochloride) and amino acids may be added to the refolding buffer. Arginine, low concentration of urea, and guanidine hydrochloride were reported to increase the solubility and suppress precipitation during proinsulin refolding (Qiao et al. 2003; Winter et al. 2002b). Although arginine and urea were both used to assist in proinsulin refolding, their efficacy and possible mechanism was found to be different (Chen et al. 2016). The oligomers formed with urea were of larger size than with arginine. It is important to optimize the concentrations of these reagents during the protein refolding process. The presence of 1.5 M guanidine hydrochloride in the refolding sample may reduce the aggregation of folding intermediates and this may be beneficial when folding was performed at a higher protein concentration of 1–2 mg/mL (Winter et al. 2002b). With the urea concentrations increasing from 2 to 4 M, the refolding yield of proinsulin aspart decreased from 40 to 30% due to the increase of disulfide-isomerized monomers (Chen et al. 2016). On the contrary, the refolding yield gradually increased to 50% with increasing arginine concentrations up to 1 M. Castellanos-Serra et al. (1996) have found that urea and guanidine hydrochloride at low molar concentrations (0.5 M–2 M) had no substantial effect in folding yield of interleukin-2-proinsulin. Hydrogen peroxide, an uncommon component in refolding buffer, is shown to minimize the formation of insulin derivatives and resulted in a simultaneous improvement of the human insulin production yield (Son et al. 2008).

It was found that protein hydrophobicity could have an influence on their sensitivity against redox reagents (Min et al. 2011). The refolding process depends largely on the redox potential. It was found that all three ratios of reduced and oxidized glutathione at 2:1, 1:1, and 1:2 resulted in complete refolding of DKP-human proinsulin (Mackin and Choquette 2003). Winter et al. (2002b) found that the yield of refolding increased as the refolding buffer became more oxidizing, possibly attributed by less formation of intermediates susceptible for aggregation. Refolding with cysteine/cystine yielded about two times more native proinsulin and the process was more efficient compared to folding in the presence of glutathione. Optimum renaturation with cysteine/cystine took place under more oxidizing conditions than with GSH/GSSG. The type and concentration of oxidant, reductant, or redox couple should be optimized to get a high refolding yield. It was demonstrated that only oxidant, instead of redox pairs, was necessary to facilitate the native disulfide-bond formation for the reduced denatured proinsulin (Chen et al. 2016). An oxidative agent of selenocystamine could improve the proinsulin refolding yield up to 80% in the presence of 0.5 M arginine. BME is a common reducing agent added for appropriate disulfide-bond formation (Kyte and Doolittle 1982; Petrides et al. 1995), and its concentration has to be optimized to assure a high refolding yield (Min et al. 2011). At a higher cysteine-to-proinsulin-SH ratio, some intermediates and folded lyspro-proinsulin were reduced to proteins with thiolate anions, which caused unfolding and high yield loss (Chen et al. 2010).

It is important to reduce the intermolecular interactions during refolding to minimize protein aggregation, and this will lead to an increase in refolding yield (Chen et al. 2010; Singh et al. 2015). Several studies have shown the effects of protein concentration on refolding yields (Chen et al. 2016; Kim et al. 2015; Mackin and Choquette 2003). It is recommended to dilute proinsulin to under 1 mg/mL to prevent cross-folding of molecules (Petrides et al. 1995). It was found that the interleukin-2-proinsulin refolding yield at 800 µg/mL was 15% of the yield at 200 µg/mL, in which the latter has a refolding yield of about 80% of correctly folded proinsulin (Castellanos-Serra et al. 1996). Refolding of DKP-human proinsulin proceeded similarly well at 50, 100, and 200 μg/mL, though a concentration at 400 μg/mL failed to result in complete refolding (Mackin and Choquette 2003). A maximum refolding yield at 50% was obtained when the proinsulin protein concentration was kept at 0.5 mg/mL, compared to 30–40% yield at a protein concentration of 1–2 mg/mL (Winter et al. 2002b).

There is a wide range of temperatures (2–25 °C) reported in proinsulin refolding (Table 13). Although a change in temperature could influence the kinetic of the formation of native proinsulin, the proinsulin refolding process was not temperature-dependent and temperature had no significant effect on the final yield (Winter et al. 2002b; Zieliński et al. 2019). This could be attributed to the good thermal stability that insulin has against thermal unfolding (Winter et al. 2002b).

The refolding duration varies from as short as 10 min to more than 48 h (Table 13). The proinsulin refolding process can be monitored to completion by analyzing aliquots of the reaction mixture on RP-HPLC. Once the steady yield was reached, it would not increase or drop further as the reaction time proceeds (Castellanos-Serra et al. 1996).

Min et al. (2011) has demonstrated that the structure of leader peptide attached to proinsulin could affect the refolding yield, because the leader peptide affects protein conformation and hydrophobicity. For example, a high refolding yield of 85% was achieved by one of the leader peptide’s high hydrophilic character.

There has been a report of using PFLC SEC column chromatography to refold the recombinant human proinsulin (Yuan et al. 2015). The propensity for protein aggregation is reduced by the gradual removal of the denaturant during buffer exchange while passing through a column (Batas and Chaudhuri 1996). Small molecules, such as urea and dithiotheritol (DTT), enter the pores of the resin and are separated from the unfolded protein molecules. The protein will start to fold into compact and partially folded conformation that can enter the resin pores as the chaotrope concentration around the unfolded species is reduced. Compared to PFLC, the dilution method of refolding proinsulin is more straightforward as it does not require the use of expensive membrane or chromatography column. The main disadvantage of using dilution method is a high buffer requirement especially for large-scale operation. PFLC-based protein refolding process may be more efficient as the protein is being refolded continuously with high productivity and low consumption of the refolding buffer.

There is no universal method to achieve high refolding efficiency and yield. It is important to optimize the refolding buffer compositions, protein concentrations, and conditions for each product individually, so as to assure the most economical and proper refolding process.

Enzymatic cleavage

Insulin is synthesized in the human body as a single-chain precursor proinsulin (preproinsulin) consisting of chain A-, B-, and C-peptide (Mollerup and Frederiksen 2016). The formation of mature insulin requires the removal of C peptide by cleavage of Arg–Arg and Lys–Arg, leaving chain A and B connected by disulfide linkages.

Similarly, in “proinsulin” route of insulin production, the recombinant precursor human insulin/analogue molecules have to be enzymatically cleaved to remove the C-peptide to produce human insulin/analogue heterodimers. In insulin production, enzymatic cleavage most frequently consists of using two enzymes, trypsin and carboxypeptidase B (CPB). Trypsin is a serine endopeptidase which cleaves at the carboxy terminal of internal arginine and lysine residues (Harrison et al. 2015). The exopeptidase carboxypeptidase B (CPB) removes basic amino acids, such as arginine, lysine, and histidine, on the C-end as a result from trypsin activity (Balcerek et al. 2018). Table 14 presents a list of optimized conditions reported for enzymatic conversion of proinsulin to produce human insulin/analogues. A wide range has been reported for the mass ratio of proinsulin to enzyme, reaction temperature and duration. Here, we see that there is no fixed protocol to follow and optimization is required for every insulin purification scheme.

Table 14 Reported optimized conditions for enzymatic conversion of proinsulin (PI) to human insulin/analogue

The leader peptide and C-peptide can be concomitantly cleaved off by trypsin if arginine and lysine linkers are designed to attach the leader peptide to proinsulin (Min et al. 2011). The enzyme reaction rate can be influenced by the type of leader peptide linker that is connected to B chain. It has been found that the enzyme reaction rate is almost doubled when an arginine, instead of a lysine linker, is used (Min et al. 2011). Other than that, the hydrophobicity and length of the leader peptide can affect the tertiary structure of fused proinsulin, which can have an impact on the rate of enzymatic reaction (Min et al. 2011). The enzymatic reaction becomes quicker when there is more exposure of the cleavage site. Thus, it is important to evaluate the rate of enzymatic reaction and conversion yield when designing fusion partners for proinsulin, as the structure of the proinsulin molecule can have an effect on the rate of conversion.

The enzymatic digests introduce impurities, such as A21 desamido insulin, B30 des-threonine insulin, insulin ethyl ester, and arginine- and diarginine-insulin (Balcerek et al. 2018; Coleman et al. 2019). The most significant ones among these contaminants are A21 desamido insulin and B30 des-threonine insulin. The reaction temperature and the ratio of trypsin to CPB are both significant factors to monitor so as to reduce the formation of B30 des-threonine insulin (Min et al. 2011). Modification of trypsin/substrate ratio was found to result in significant changes in HPLC profiles of the digest products (Castellanos-Serra et al. 1996). Fluctuations in temperature between 4 and 37 °C have a strong influence on the rate of enzymatic digestion, with only a minimal effect on the proportion between human insulin and des-octa-insulin generated.

It is also important to monitor the entire length of enzymatic reaction, and to quench it at the maximal insulin point, because the B30 des-threonine insulin amount could increase with time (Min et al. 2011). The reaction time was molecule-dependent. It was found that the C-peptide was completely removed after 10 min of trypsinization, but the N-terminal extension of interleukin-2-proinsulin was completely removed after prolonged reaction of 5 h (Castellanos-Serra et al. 1996). The digestion can be monitored by RP-HPLC to determine the end point of reaction. There are a few methods to quench the enzymatic reactions. As the tryptic conversion of proinsulin to insulin is performed at around pH 7.5, the first method is to titrate down the pH of the reaction mixture to pH 2.5–3.5 by adding glacial acetic acid (Zimmerman and Stokell 2010) or trifluoroacetic acid (Nilsson et al. 1996). This is a suitable pH for directly loading the digest pool unto CEX, as the digest pool is at a pH lower than the pI of the properly folded insulin/insulin analogue. The second method involves the addition of a protease inhibitor, such as aprotinin, to the digest pool to stop the enzymatic digestion prior to chromatographic purification (Watson et al. 2018). This method is useful if the next column after digestion is an AEX/MMA, because there is no need to titrate down the pH. There has also been a report of using isopropanol (final concentration: 40%) to stop the cleavage (Bai et al. 2003).

Prior to proteolysis reaction, citraconylation can be performed to regulate enzymatic digestion and to avoid insulin degradation (Mikiewicz et al. 2017; Zieliński et al. 2019). The substrate-binding site of bovine trypsin has a negatively charged Asp-189 residue, which can have ionic interactions with the substrate through their positively charged amino acid side chains (Evnin et al. 1990; Gráf et al. 1988). The trypsin cleavage sites can be altered by the addition or deletion of positive charges, as trypsin only cleaves peptide bonds behind positively charged residues. In citraconylation, the insulin precursor is treated with citraconic or maleic acid anhydride, in which the lysine residue (LysB29) is blocked by acetylation reaction. As a result, its positive charge is converted into negative charge and thus prevents the hydrolysis of the peptide bond from C-end of lysine in B chain. The blocking of lysine and arginine residues by citraconic anhydride is reversible to allow for a controlled tryptic digestion (Zieliński et al. 2019). Citraconic anhydride blocks primary amine groups at alkaline pH values above 8. At an acidic pH under pH 4, the amide linkage is hydrolyzed and citraconic acid is released to yield the original amine (Zieliński et al. 2019).

A typical citraconylation protocol in insulin production entails incubation of the solution in pH 8.4–9.3 with stirring at 25 °C for 2–2.5 h (Balcerek et al. 2018; Hwang et al. 2016; Zieliński et al. 2019). After precursor proteolysis, decitraconylation is carried out by incubation of proinsulin in pH 2.5–3.0 in the cold or room temperature for several hours (Balcerek et al. 2018; Hwang et al. 2016; Son et al. 2009; Zieliński et al. 2019).

The correctness of citraconylation and trypsinization reactions can be verified by mass spectrometry (Zieliński et al. 2019). Citraconylation is able to reduce the amount of several types of digest contaminants. As the presence of B30 des-threonine insulin contaminant is a result of B30 threonine elimination by trypsin, citraconylation was able to reduce the amount of this contaminant by 5–10% (Balcerek et al. 2018). Citraconylation of lysine residues in human proinsulin blocked trypsin cleavage, reduced the formation of des-threonine insulin from 13.5 to 1.0%, and hence increased the production yield of active insulin (Son et al. 2009). Citraconylation minimized the formation of Arg (B31)-insulin and raised the enzymatic conversion yield of glargine insulin by 3.2-fold, as compared to without citraconylation (Hwang et al. 2016).

To eliminate or reduce certain unwanted cleavage by-products, enzymes such as enterokinase, Asp-N endoproteinase, and Kex2 protease have been explored in insulin purification (Balcerek et al. 2018). In particular, Kex2 protease has a high specificity and good proteolytic stability even during extended incubation, in which this is an advantage over commonly used enzymes (Balcerek et al. 2018). Due to the specificity of enzyme action, it may be necessary to introduce modifications in the proinsulin peptide sequence to direct cleavage at the desired site if there is no naturally occurring cleavage site recognizable by the enzyme (Balcerek et al. 2018). For example, after switching the lysine residue with an alanine residue at position 64 of the native proinsulin sequence, the theoretical enzyme reaction yield increased, because the formation of arg-AO-insulin during the trypsin cleavage was prevented (Zimmerman and Stokell 2010).

The conditions adopted in enzymatic conversion can have an effect on the quantity and variety of contaminants generated. It is important to optimize the cleavage step to increase the yield of properly cleaved human insulin/analogue and to reduce the amount of undesirable digest by-products. A higher purity achieved in the digest pool will simplify subsequent purification steps, thus leading to greater cost savings in production.

Formulation

After final chromatographic purification, the product must be conditioned for subsequent formulation into a drug product that is stable during storage. The main risk factors involved when handling the product after the final chromatographic purification are deamidation of the insulin molecule and formation of aggregates/fibrils (Brange 1994). Aggregation can occur after stressing the molecule by agitation, low pH, high temperature, high protein concentration, high ionic strength, and in the presence of denaturants or high concentration of organic solvent (Nielsen et al. 2001; Tiiman et al. 2013). Deamidation can occur when there is prolonged exposure to acidic conditions to form A21 desamido insulin (Mollerup et al. 2010). Thus, to prevent insulin fibrillation and deamidation, residual salts and buffers from chromatographic purification must be removed via crystallization and lyophilisation. The insulin crystals can be isolated by centrifugation, decantation or filtration, and eventually, they are washed to remove the residual zinc content. After zinc crystallization, the next step that follows is freeze-drying (Coleman et al. 2019). The recombinant product will be solubilized when ready for packaging (Zimmerman and Stokell 2010).

Specific additives can be added to the formulation to prevent bacterial growth and protein aggregation, or to modify the in vivo absorption kinetics of insulin. For example, m-cresol could be added to suppress insulin amyloid formation (Ohno et al. 2019). Complexation of insulin with zinc and/or protamine can generate intermediate- or long-acting formulations (Waller and Sampson 2017).

Conclusions

Insulin is a scarce commodity, with global demands projected to rise due to increasing prevalence of diabetes. The chronic nature of diabetes means that patients require long-term insulin treatment, and this can place heavy financial burden on the healthcare systems and individuals. It is therefore imperative that all the manufacturers and stakeholders continue to play their parts in making further progresses in therapeutic recombinant insulin development and production. A comprehensive review of recombinant human insulin and its analogues’ downstream processing via the “proinsulin route” from E. coli-based system has been presented. We have provided pertinent examples of critical production stages and discussed the practical aspects of integrating every procedure into a multimodal purification scheme. As evidenced from this review, a large number of purification and conversion steps are required to recover and purify the recombinant insulin product. This review may serve as a guide to help downstream process scientists develop a more efficient and economical process for the production of human insulin and its analogues.

Availability of data and materials

All data generated or analyzed during this study are included in this published article.

Abbreviations

AC:

Affinity chromatography

ACN:

Acetonitrile

AEX:

Anion exchange chromatography

BME:

β-Mercaptoethanol

CEX:

Cation exchange chromatography

CPB:

Carboxypeptidase B

CNBr:

Cyanogen bromide

Cryst.:

Crystallization

CV:

Column volume

DTT:

Dithiothreitol

ELISA:

Enzyme-linked immunosorbent assay

fMet:

N-Formylmethionine

GdnHCl:

Guanidine hydrochloride

GSH:

Reduced glutathione

GSSG:

Oxidized glutathione

HIC:

Hydrophobic interaction chromatography

HPLC:

High-performance liquid chromatography

IB:

Inclusion body

IEX:

Ion-exchange chromatography

IMAC:

Immobilized metal ion affinity chromatography

MMC:

Mixed-mode chromatography

NA:

Information not available

PEG:

Polyethyleneglycol

PFLC:

Protein-folding liquid chromatography

pH-PPT:

pH precipitation

PI:

Proinsulin

pI:

Isoelectric point

rhPI:

Recombinant human proinsulin

RP:

Reversed-phase chromatography

RTP:

Room temperature

SDS-PAGE:

Sodium dodecyl sulfate-polyacrylamide gel electrophoresis

SEC:

Size exclusion chromatography

SMB:

Simulated moving bed

TFA:

Trifluoroacetic acid

ZnCl2-PPT:

Zinc chloride precipitation

References

  1. Astolfi Filho S, De Lima BD, Thiemann JE, de Sousa HR, Vilela L (2004) Vector for expression of heterologous protein and methods for extracting recombinant protein and for purifying isolated recombinant insulin. United States patent 6,699,692, 2 Mar 2004.

  2. Baeshen NA, Baeshen MN, Sheikh A, Bora RS, Ahmed MM, Ramadan HA, Saini KS, Redwan EM (2014) Cell factories for insulin production. Microb Cell Fact 13(1):141

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  3. Bai Q, Kong Y, Geng XD (2003) Studies on renaturation with simultaneous purification of recombinant human proinsulin from E. coli with high performance hydrophobic interaction chromatography. J Liquid Chromatogr Relat Technol 26(5):683–695

    CAS  Article  Google Scholar 

  4. Balcerek J, Sznilik K, Jaros S, Wieczorek M (2018) Method for preparation of a recombinant protein from a precursor. United States patent US 15/532,811, 19 Apr 2018.

  5. Baneyx F (1999) Recombinant protein expression in Escherichia coli. Curr Opin Biotechnol 10(5):411–421

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  6. Batas B, Chaudhuri JB (1996) Protein refolding at high concentration using size-exclusion chromatography. Biotechnol Bioeng 50(1):16–23

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  7. Brange J (1994) Stability of insulin: studies on the physical and chemical stability of insulin in pharmaceutical formulation. Kluwer Academic, Boston

    Google Scholar 

  8. Brange J (2012) Galenics of insulin: the physico-chemical and pharmaceutical aspects of insulin and insulin preparations. Springer, Berlin

    Google Scholar 

  9. Brange J, Langkjœr L (1993) Insulin structure and stability. Stability and characterization of protein and peptide drugs. Springer, Boston, pp 315–350

    Chapter  Google Scholar 

  10. Carr D (2002) The handbook of analysis and purification of peptides and proteins by reversed-phase HPLC. Hesperia: Grace Vydac.

  11. Carr D (2016) A guide to the analysis and purification of proteins and peptides by reversed-phase HPLC. Advanced Chromatography Technologies, New York

    Google Scholar 

  12. Castellanos-Serra LR, Hardy E, Ubieta R, Vispo NS, Fernandez C, Besada V, Falcon V, Gonzalez M, Santos A, Perez G, Silva A (1996) Expression and folding of an interleukin-2-proinsulin fusion protein and its conversion into insulin by a single step enzymatic removal of the C-peptide and the N-terminal fused sequence. FEBS Lett 378(2):171–176

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  13. Chance R, Frank B (1993) Research, development production and safety of biosynthetic human insulin. Diabetes Care 16(3):133–142

    PubMed  Article  PubMed Central  Google Scholar 

  14. Chance RE, Glazer NB, Wishner KL (1999) Insulin lispro (humalog) biopharmaceuticals, an industrial perspective. Springer, Berlin, pp 149–172

    Book  Google Scholar 

  15. Chang SG, Choi KD, Jang SH, Shin HC (2003) Role of disulfide bonds in the structure and activity of human insulin. Mol Cells 16:3

    Google Scholar 

  16. Chatani E, Imamura H, Yamamoto N, Kato M (2014) Stepwise organization of the β-structure identifies key regions essential for the propagation and cytotoxicity of insulin amyloid fibrils. J Biol Chem 289(15):10399–10410

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  17. Chen S, Adijanto L, Wang NH (2010) In vitro folding of methionine-arginine human lyspro-proinsulin S-sulfonate—disulfide formation pathways and factors controlling yield. Biotechnol Prog 26(5):1332–1343

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  18. Chen Y, Wang Q, Zhang C, Li X, Gao Q, Dong C, Liu Y, Su Z (2016) Improving the refolding efficiency for proinsulin aspart inclusion body with optimized buffer compositions. Protein Expr Purif 122:1–7

    PubMed  Article  CAS  PubMed Central  Google Scholar 

  19. Chouhan R, Goswami S, Bajpai AK (2017) Recent advancements in oral delivery of insulin: from challenges to solutions. Nanostructures for oral medicine. Elsevier, New York, pp 435–465

    Google Scholar 

  20. Clark ED (1998) Refolding of recombinant proteins. Curr Opin Biotechnol 9(2):157–163

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  21. Coleman MP, Ortigosa AD, Sleevi MC, Kartoa CH (2019) Process for purifying insulin and analogues thereof. United States patent US 10,421,795, 24 Sep 2019.

  22. Cowley DJ, Mackin RB (1997) Expression, purification and characterization of recombinant human proinsulin. FEBS Lett 402(2–3):124–130

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  23. Cytiva (2021) Sephadex G-25 Medium. https://www.cytivalifesciences.com/en/us/shop/chromatography/resins/size-exclusion/sephadex-g-25-medium-p-05608#tech-spec-table. Accessed 14 Apr 2021.

  24. Dickhardt R, Unger B (1997) Chromatographic process for purification of insulin. United States patent US 5,621,073, 15 Apr 1997.

  25. Dickhardt R, Unger B, Hafner L (1993) Process for the purification of insulins by chromatography. United States patent US 5,245,008, 14 Sep 1993.

  26. Evnin LB, Vásquez JR, Craik CS (1990) Substrate specificity of trypsin investigated by using a genetic selection. Proc Natl Acad Sci 87(17):6659–6663

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  27. Fortune Business Insights (2020) Human insulin market size, share & industry analysis, by type (analogue insulin, traditional human insulin), by diabetes type (type 1, type 2), by distribution channel (retail pharmacy, hospital pharmacy, online pharmacy), and regional forecast, 2019–2026. https://www.fortunebusinessinsights.com/industry-reports/human-insulin-market-100395. Accessed 14 Apr 2021.

  28. Frank BH, Pettee JM, Zimmerman RE, Burck PJ (1981) In: Rich D, Gross E, eds. Peptides: proceedings of the Seventh American Peptide Chemistry Symposium. Berlin: Springer, p 729–739.

  29. Freeman JS (2009) Insulin analog therapy: improving the match with physiologic insulin secretion. J Am Osteopath Assoc 109(1):26–36

    PubMed  Google Scholar 

  30. GE Healthcare, 28-9966-22, Edition AA (2012) Application note: High-throughput screening and process development for capture of recombinant pro-insulin from E. coli.

  31. GE Healthcare 29-0018-56 (2012) Application note: high-throughput screening, process development, and scale-up of an intermediate purification step for recombinant insulin.

  32. Govender K, Naicker T, Baijnath S, Chuturgoon AA, Abdul NS, Docrat T, Kruger HG, Govender T (2020) Sub/supercritical fluid chromatography employing water-rich modifier enables the purification of biosynthesized human insulin. Journal of Chromatography B 1155:122126.

  33. Gráf L, Jancso A, Szilágyi L, Hegyi G, Pintér K, Náray-Szabó G, Hepp J, Medzihradszky K, Rutter WJ (1988) Electrostatic complementarity within the substrate-binding pocket of trypsin. Proc Natl Acad Sci 85(14):4961–4965

    PubMed  PubMed Central  Article  Google Scholar 

  34. Guo ZY, Qiao ZS, Feng YM (2008) The in vitro oxidative folding of the insulin superfamily. Antioxid Redox Signal 10(1):127–140

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  35. Harrison RG, Todd P, Rudge SR, Petrides DP (2015) Bioseparations science and engineering. Oxford University Press, USA

    Book  Google Scholar 

  36. Healthcare GE (2006). Hydrophobic Interaction and Reversed Phase Chromatography Principles and Methods. https://cdn.cytivalifesciences.com/dmm3bwsv3/AssetStream.aspx?mediaformatid=10061&destinationid=10016&assetid=14026. Accessed 14 Apr 2021.

  37. Heldin E, Grönlund S, Shanagar J, Hallgren E, Eriksson K, Xavier M, Tunes H, Vilela L (2014) Development of an intermediate chromatography step in an insulin purification process. The use of a high throughput process development approach based on selectivity parameters. J Chromatogr B 973:126–132

    CAS  Article  Google Scholar 

  38. Helmerhorst E, Stokes GB (1987) Self-association of insulin: Its pH dependence and effect of plasma. Diabetes 36(3):261–264

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  39. Hua QX, Jia W, Frank BH, Phillips NF, Weiss MA (2002) A protein caught in a kinetic trap: structures and stabilities of insulin disulfide isomers. Biochemistry 41(50):14700–14715

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  40. Hwang HG, Kim KJ, Lee SH, Kim CK, Min CK, Yun JM, Lee SU, Son YJ (2016) Recombinant glargine insulin production process using Escherichia coli. J Microbiol Biotechnol 26(10):1781–1789

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  41. Jia XY, Guo ZY, Wang Y, Xu Y, Duan SS, Feng YM (2003) Peptide models of four possible insulin folding intermediates with two disulfides. Protein Sci 12(11):2412–2419

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  42. Johansson K, Frederiksen SS, Degerman M, Breil MP, Mollerup JM, Nilsson B (2015) Combined effects of potassium chloride and ethanol as mobile phase modulators on hydrophobic interaction and reversed-phase chromatography of three insulin variants. J Chromatogr A 1381:64–73

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  43. Kim CK, Lee SB, Son YJ (2015) Large-scale refolding and enzyme reaction of human preproinsulin for production of human insulin. J Microbiol Biotechnol 25(10):1742–1750

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  44. Kossiakoff AA (1988) Tertiary structure is a principal determinant to protein deamidation. Science 240(4849):191–194

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  45. Kroeff EP, Owens RA, Campbell EL, Johnson RD, Marks HI (1989) Production scale purification of biosynthetic human insulin by reversed-phase high-performance liquid chromatography. J Chromatogr A 461:45–61

    CAS  Article  Google Scholar 

  46. Kromasil (2021). Kromasil preparative HPLC applications—success stories. https://www.kromasil.com/notes/?NOTEkpmo. Accessed 14 Apr 2021.

  47. Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157(1):105–132

    CAS  PubMed  Article  Google Scholar 

  48. Ladisch MR, Kohlmann KL (1992) Recombinant human insulin. Biotechnol Prog 8(6):469–478

    CAS  PubMed  Article  Google Scholar 

  49. Laursen BS, Sørensen HP, Mortensen KK, Sperling-Petersen HU (2005) Initiation of protein synthesis in bacteria. Microbiol Mol Biol Rev 69(1):101–123

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  50. Leng C, Li Q, Wu F, Chen L, Su P (2013) Detection of the single-chain precursor in the production and purification process of recombinant human insulin. Monocl Antibodies Immunodiagn Immunother 32(4):255–261

    CAS  Article  Google Scholar 

  51. Mackin RB (1999) Streamlined procedure for the production of normal and altered versions of recombinant human proinsulin. Protein Expr Purif 15(3):308–313

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  52. Mackin RB (2014) Alternative preparation of inclusion bodies excludes interfering non-protein contaminants and improves the yield of recombinant proinsulin. MethodsX 1:108–117

    PubMed  PubMed Central  Article  Google Scholar 

  53. Mackin RB, Choquette MH (2003) Expression, purification, and PC1-mediated processing of (H10D, P28K, and K29P)-human proinsulin. Protein Expr Purif 27(2):210–219

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  54. Mergulhao FJ, Monteiro GA, Cabral JM, Taipa MA (2001) A quantitative ELISA for monitoring the secretion of ZZ-fusion proteins using SpA domain as immunodetection reporter system. Mol Biotechnol 19(3):239–244

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  55. Mergulhao FJ, Taipa MA, Cabral JM, Monteiro GA (2004) Evaluation of bottlenecks in proinsulin secretion by Escherichia coli. J Biotechnol 109(1–2):31–43

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  56. Merck Millipore (2021) Dialysis or Diafiltration? https://www.drogallega.es/u/uploads/File/1335219226_DialysisorDiafiltration.MM2.pdf. Accessed 14 Apr 2021.

  57. Mikiewicz D, Bierczyńska-Krzysik A, Sobolewska A, Stadnik D, Bogiel M, Pawłowska M, Wójtowicz-Krawiec A, Baran PA, Łukasiewicz N, Romanik-Chruścielewska A, Sokołowska I (2017) Soluble insulin analogs combining rapid-and long-acting hypoglycemic properties–from an efficient E. coli expression system to a pharmaceutical formulation. PLoS ONE 12(3):e0172600

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  58. Min CK, Son YJ, Kim CK, Park SJ, Lee JW (2011) Increased expression, folding and enzyme reaction rate of recombinant human insulin by selecting appropriate leader peptide. J Biotechnol 151(4):350–356

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  59. Mollerup I, Jensen SW, Larsen P, Schou O, Snel L (2010) Insulin purification. Encyclop Ind Biotechnol 4:1–20

    Google Scholar 

  60. Mollerup JM, Frederiksen SS (2016) Purification of insulin. United States patent US 9,447,163, 20 Sep 2016.

  61. Muzaffar M, Ahmad A (2011) The mechanism of enhanced insulin amyloid fibril formation by NaCl is better explained by a conformational change model. PLoS ONE 6(11):e27906

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  62. Nielsen L, Khurana R, Coats A, Frokjaer S, Brange J, Vyas S, Uversky VN, Fink AL (2001) Effect of environmental factors on the kinetics of insulin fibril formation: elucidation of the molecular mechanism. Biochemistry 40(20):6036–6046

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  63. Nilsson J, Jonasson P, Samuelsson E, Stahl S, Uhlén M (1996) Integrated production of human insulin and its C-peptide. J Biotechnol 48(3):241–250

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  64. Ohno Y, Seki T, Kojima Y, Miki R, Egawa Y, Hosoya O, Kasono K, Seki T (2019) Investigation of factors that cause insulin precipitation and/or amyloid formation in insulin formulations. J Pharm Health Care Sci 5(1):1–1

    Article  Google Scholar 

  65. Palmer I, Wingfield PT (2012) Preparation and extraction of insoluble (inclusion-body) proteins from Escherichia coli, Chap 6 (Unit 6.3). In: Current protocols in protein science

  66. Patrick JS, Lagu AL (1992) Determination of recombinant human proinsulin fusion protein produced in Escherichia coli using oxidative sulfitolysis and two-dimensional HPLC. Anal Chem 64(5):507–511

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  67. Petrides D, Sapidou E, Calandranis J (1995) Computer-aided process analysis and economic evaluation for biosynthetic human insulin production—A case study. Biotechnol Bioeng 48(5):529–541

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  68. Pharmacia Biotech (2021) Sephadex® G-25 media and pre-packed columns. http://www.chembio.uoguelph.ca/educmat/chm357/g25.pdf. Accessed 14 Apr 2021

  69. Qiao ZS, Min CY, Hua QX, Weiss MA, Feng YM (2003) In vitro refolding of human proinsulin: kinetic intermediates, putative disulfide-forming pathway, folding initiation site, and potential role of c-peptide in folding process. J Biol Chem 278(20):17800–17809

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  70. Qiao ZS, Guo ZY, Feng YM (2006) In vitro folding/unfolding of insulin/single-chain insulin. Protein & Peptide Lett 13(5):423–429

  71. Redwan ER, Matar SM, El-Aziz GA, Serour EA (2007) Synthesis of the human insulin gene: protein expression, scaling up and bioactivity. Prep Biochem Biotechnol 38(1):24–39

    Article  CAS  Google Scholar 

  72. Ritchie C (2012) Protein purification. Mater Methods 2:134. https://doi.org/10.13070/mm.en.2.134.

  73. Schoner RG, Ellis LF, Schoner BE (1985) Isolation and purification of protein granules from Escherichia coli cells overproducing bovine growth hormone. Bio/technology 3(2):151–154

    CAS  Article  Google Scholar 

  74. Sigma (2021) INSULIN, HUMAN, RECOMBINANT EXPRESSED IN E. coli (Product information sheet). https://www.sigmaaldrich.com/content/dam/sigma-aldrich/docs/Sigma/Product_Information_Sheet/2/i2767pis.pdf. Accessed 14 Apr 2021.

  75. Singh A, Upadhyay V, Upadhyay AK, Singh SM, Panda AK (2015) Protein recovery from inclusion bodies of Escherichia coli using mild solubilization process. Microb Cell Fact 14(1):41

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  76. Singhvi P, Saneja A, Srichandan S, Panda AK (2020) Bacterial inclusion bodies: a treasure trove of bioactive proteins. Trends Biotechnol 38(5):474–486

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  77. Son YJ, Kim CK, Choi BT, Park YC, Seo JH (2008) Effects of β-mercaptoethanol and hydrogen peroxide on enzymatic conversion of human proinsulin to insulin. J Microbiol Biotechnol 18(5):983–989

    CAS  PubMed  Google Scholar 

  78. Son YJ, Kim CK, Kim YB, Kweon DH, Park YC, Seo JH (2009) Effects of citraconylation on enzymatic modification of human proinsulin using trypsin and carboxypeptidase B. Biotechnol Prog 25(4):1064–1070

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  79. Sung WL, Yao FL, Zahab DM, Narang SA (1986) Short synthetic oligodeoxyribonucleotide leader sequences enhance accumulation of human proinsulin synthesized in Escherichia coli. Proc Natl Acad Sci 83(3):561–565

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  80. Thurow H, Blumenstock H, Havenith C (2010) Method of purifying preproinsulin. United States patent US 7,803,763, 28 Sep 2010.

  81. Tiiman A, Noormägi A, Friedemann M, Krishtal J, Palumaa P, Tõugu V (2013) Effect of agitation on the peptide fibrillization: Alzheimer’s amyloid-β peptide 1–42 but not amylin and insulin fibrils can grow under quiescent conditions. J Pept Sci 19(6):386–391

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  82. Tikhonov RV, Pechenov SE, Belacheu IA, Yakimov SA, Klyushnichenko VE, Boldireva EF, Korobko VG, Tunes H, Thiemann JE, Vilela L, Wulfson AN (2001) Recombinant Human Insulin: VIII. Isolation of Fusion Protein–S-Sulfonate, Biotechnological Precursor of Human Insulin, from the Biomass of Transformed Escherichia coli Cells. Protein expression and purification 21(1):176–82.

  83. Verma S (2021). Reversed-phase HPLC Buffers. Sigma-aldrich. Reporter US Volume 27.4. https://www.sigmaaldrich.com/technical-documents/articles/reporter-us/reversed-phase-hplc.html#:~:text=The%20typical%20pH%20range%20for,is%20pH%202%20to%208. Accessed 14 Apr 2021.

  84. Waller DG, Sampson T (2017) Medical pharmacology and therapeutics E-Book. Elsevier, New York

    Google Scholar 

  85. Walsh G (2005) Therapeutic insulins and their large-scale manufacture. Appl Microbiol Biotechnol 67(2):151–159

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  86. Wang NH, Xie Y, Mun S (2003) Insulin purification using simulated moving bed technology. United States patent US 10/276,948, 20 Nov 2003.

  87. Watson DS, Ortigosa AD, Rauscher MA, Story KM (2018) Chromatography process for purification of insulin and insulin analogs. United States patent US 10,155,799, 18 Dec 2018.

  88. Whittingham JL, Scott DJ, Chance K, Wilson A, Finch J, Brange J, Dodson GG (2002) Insulin at pH 2: structural analysis of the conditions promoting insulin fibre formation. J Mol Biol 318(2):479–490

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  89. Winter J, Lilie H, Rudolph R (2002a) Recombinant expression and in vitro folding of proinsulin are stimulated by the synthetic dithiol Vectrase-P. FEMS Microbiol Lett 213(2):225–230

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  90. Winter J, Lilie H, Rudolph R (2002b) Renaturation of human proinsulin—a study on refolding and conversion to insulin. Anal Biochem 310(2):148–155

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  91. Wintersteiner O, Abramson HA (1933) The isoelectric point of insulin electrical properties of adsorbed and crystalline insulin. J Biol Chem 99(3):741–753

    CAS  Article  Google Scholar 

  92. World Health Organization, WHO (2021) The WHO Global Diabetes Compact. https://www.who.int/docs/default-source/world-diabetes-day/global-diabetes-compact-final.pdf. Accessed 16 Apr 2021.

  93. Yuan J, Zhou H, Yang Y, Li W, Wan Y, Wang L (2015) Refolding and simultaneous purification of recombinant human proinsulin from inclusion bodies on protein-folding liquid-chromatography columns. Biomed Chromatogr 29(5):777–782

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  94. Zhang K, Liu X (2016) Mixed-mode chromatography in pharmaceutical and biopharmaceutical applications. J Pharm Biomed Anal 128:73–88

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  95. Zieliński M, Romanik-Chruścielewska A, Mikiewicz D, Łukasiewicz N, Sokołowska I, Antosik J, Sobolewska-Ruta A, Bierczyńska-Krzysik A, Zaleski P, Płucienniczak A (2019) Expression and purification of recombinant human insulin from E. coli 20 strain. Protein Expr Purif 157:63–69

    PubMed  Article  CAS  PubMed Central  Google Scholar 

  96. Zimmerman RE, Stokell DJ (2010) Insulin production methods and pro-insulin constructs. United States patent US 7

Download references

Acknowledgements

This research is supported by the Agency for Science, Technology and Research (A*STAR), Singapore.

Funding

This research is supported by the Agency for Science, Technology and Research (A*STAR), Singapore.

Author information

Affiliations

Authors

Contributions

YYS conceived and prepared the first draft, and WZ made revision. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Wei Zhang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Siew, Y.Y., Zhang, W. Downstream processing of recombinant human insulin and its analogues production from E. coli inclusion bodies. Bioresour. Bioprocess. 8, 65 (2021). https://doi.org/10.1186/s40643-021-00419-w

Download citation

Keywords

  • Recombinant human insulin
  • Insulin analogues
  • E. coli inclusion bodies
  • Downstream processing
  • Purification