파이썬으로 분산분석(ANOVA)

출처 : Thomas Haslwanter, CMPT 318

ANOVA (ANalysis Of VAriance; 분산 분석)은 여러 그룹간의 평균의 차이가 통계적으로 유의미 한지를 판단하기 위한 시험법입니다. T-test 처럼 말이죠. 하지만 그룹의 수가 2개 이상일때 사용한다는 점이 다른점이죠.

분산분석에는 다음과 같은 3가지의 조건이 필요해요.

  • 정규성 : 각각의 그룹에서 변인은 정규분포.
  • 분산의 동질성 : Y의 모집단 분산은 각각의 모집단에서 동일.
  • 관찰의 독립성: 각각의 모집단에서 크기가 각각인 표본들이 독립적으로 표집.

파이썬에서 분산분석을 사용하기 위한 라이브러리를 불러오는 코드부터 실행하겠습니다.

In [1]:
import scipy.stats as stats
import pandas as pd
import urllib
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm
import matplotlib.pyplot as plt
import numpy as np

%matplotlib inline

분산(variance)의 중요함

아래의 두개의 그림을 통해 분산에 대해 알아 보겠습니다. 평균값은 동일하지만 분산 값이 다름으로 인해 전체적인 데이터의 모습이 완전히 달라 보이는것을 알 수 있죠.

In [2]:
centers = [5, 5.3, 4.5]
std = 0.1
colors = "brg"

data_1 = []
for i in range(3):
    data_1.append(stats.norm(centers[i], std).rvs(100))
    plt.plot(
        np.arange(len(data_1[i])) + i * len(data_1[0]), data_1[i], ".", color=colors[i]
    )
No description has been provided for this image
In [3]:
std_2 = 2
data_2 = []
for i in range(3):
    data_2.append(stats.norm(centers[i], std_2).rvs(100))
    plt.plot(
        np.arange(len(data_1[i])) + i * len(data_2[0]), data_2[i], ".", color=colors[i]
    )
No description has been provided for this image

분산이 클수록 집단의 평균값의 차이가 무의미해집니다.

이것을 통해 두가지를 알 수 있는데요. 1. 집단 평균값 의 분산이 클수록 그리고, 2. 집단 내 분산이 작아질수록 평균의 차이가 분명해집니다.

집단 간 분산집단 내 분산 이 두가지를 이용해 분석을 하기에 분산분석이라고 합니다.

분산분석에는 여러 종류가 있지만, 여기서는 일원분산분석(One-way ANOVA)이원분산분석(Two-way ANOVA)를 알아보겠습니다.

일원분산분석(One-way ANOVA)

종속변인은 1개이며, 독립변인의 집단도 1개인 경우입니다. 한가지 변수의 변화가 결과 변수에 미치는 영향을 보기 위해 사용됩니다.

파이썬에서 One-way ANOVA 분석은 scipy.stats이나 statsmodel 라이브러리를 이용해서 할 수 있습니다.

statsmodel 라이브러리가 좀 더 많고 규격화된 정보를 제공합니다.

예시 데이터(Altman 910) 설명

22명의 심장 우회 수술을 받은 환자를 다음의 3가지 그룹으로 나누었습니다.

  • Group I: 50% 아산화 질소(nitrous oxide)와 50%의 산소(oxygen) 혼합물을 24시간 동안 흡입한 환자
  • Group II: 50% 아산화 질소와 50% 산소 혼합물을 수술 받는 동안만 흡입한 환자
  • Group III: 아산화 질소 없이 오직 35-50%의 산소만 24시간동안 처리한 환자

그런 다음 적혈구의 엽산 수치를 24시간 이후에 측정하였습니다.

In [4]:
# url로 데이터 얻어오기
url = "https://raw.githubusercontent.com/thomas-haslwanter/statsintro_python/master/ipynb/Data/data_altman/altman_910.txt"
data = np.genfromtxt(urllib.request.urlopen(url), delimiter=",")

# Sort them into groups, according to column 1
group1 = data[data[:, 1] == 1, 0]
group2 = data[data[:, 1] == 2, 0]
group3 = data[data[:, 1] == 3, 0]

# matplotlib plotting
plot_data = [group1, group2, group3]
ax = plt.boxplot(plot_data)
plt.show()
No description has been provided for this image

Boxplot에서 볼 수 있듯이, 평균값의 차이가 실제로 의미가 있는 차이인지, 분산이 커서 그런것인지 애매한 상황입니다. 이런 상황에서 분산분석을 통해 통계적 유의성을 알아 볼 수 있습니다.

Scipy.stats으로 일원분산분석

아래와 같은 간단한 코드로 분산분석을 할 수 있습니다.

In [5]:
F_statistic, pVal = stats.f_oneway(group1, group2, group3)

print(
    "Altman 910 데이터의 일원분산분석 결과 : F={0:.1f}, p={1:.5f}".format(
        F_statistic, pVal
    )
)
if pVal < 0.05:
    print(
        "P-value 값이 충분히 작음으로 인해 그룹의 평균값이 통계적으로 유의미하게 차이납니다."
    )
Altman 910 데이터의 일원분산분석 결과 : F=3.7, p=0.04359
P-value 값이 충분히 작음으로 인해 그룹의 평균값이 통계적으로 유의미하게 차이납니다.

이번에는 좀 더 우아한 방법으로 pandasstatsmodels 라이브러리를 사용해서 분산분석을 해보겠습니다.

Statsmodel을 사용한 일원분산분석

In [6]:
# 경고 메세지 무시하기
import warnings

warnings.filterwarnings("ignore")

df = pd.DataFrame(data, columns=["value", "treatment"])

# the "C" indicates categorical data
model = ols("value ~ C(treatment)", df).fit()

print(anova_lm(model))
                df        sum_sq      mean_sq         F    PR(>F)
C(treatment)   2.0  15515.766414  7757.883207  3.711336  0.043589
Residual      19.0  39716.097222  2090.320906       NaN       NaN

보시다 싶이 statsmodels 을 사용하면 보다 간편하면서도 깔끔한 결과를 얻을 수 있습니다.

이원분산분석(two-way ANOVA)

독립변인의 수가 두 개 이상일 때 집단 간 차이가 유의한지를 검증하는 데 사용합니다. 상호작용효과(Interaction effect), 한 변수의 변화가 결과에 미치는 영향이 다른 변수의 수준에 따라 달라지는지를 확인하기 위해 사용됩니다.

예제 데이터(altman_12_6) 설명

태아의 머리 둘레 측정 데이터입니다. 4명의 관측자가 3명의 태아를 대상으로 측정을 했습니다. 이를 통해서 초음파로 태아의 머리 둘레측정 데이터가 재현성이 있는지를 조사하였습니다.

In [7]:
inFile = "altman_12_6.txt"
url_base = "https://raw.githubusercontent.com/thomas-haslwanter/statsintro_python/master/ipynb/Data/data_altman/"
url = url_base + inFile
data = np.genfromtxt(urllib.request.urlopen(url), delimiter=",")

# Bring them in dataframe-format
df = pd.DataFrame(data, columns=["head_size", "fetus", "observer"])
# df.tail()

# 태아별 머리 둘레 plot 만들기
df.boxplot(column="head_size", by="fetus", grid=False)
Out[7]:
<matplotlib.axes._subplots.AxesSubplot at 0xb21b748>
No description has been provided for this image

그림을 보니 태아(fetus) 3명의 머리 둘레는 차이가 있어보입니다. 그런데 이것이 관측자와 상호작용이 있는것인지 분석을 통해 알아 보겠습니다.

분산분석으로 상관관계

statmodels 라이브러리를 사용해 간단하게 계산할 수 있습니다.

In [9]:
formula = "head_size ~ C(fetus) + C(observer) + C(fetus):C(observer)"
lm = ols(formula, df).fit()
print(anova_lm(lm))
                        df      sum_sq     mean_sq            F        PR(>F)
C(fetus)               2.0  324.008889  162.004444  2113.101449  1.051039e-27
C(observer)            3.0    1.198611    0.399537     5.211353  6.497055e-03
C(fetus):C(observer)   6.0    0.562222    0.093704     1.222222  3.295509e-01
Residual              24.0    1.840000    0.076667          NaN           NaN

P-value 가 0.05 이상 입니다, 따라서 귀무가설을 기각할 수 없고. 측정자와 태아의 머리둘레값에는 연관성이 없다고 할 수 있습니다. 측정하는 사람이 달라도 머리 둘레값은 일정하는 것이죠.

결론적으로 초음파로 측정하는 태아의 머리둘레값은 믿을 수 있다는 것입니다.

마치며

분산분석(ANOVA)는 전체 그룹간의 평균값 차이가 통계적 의미가 있는지 판단하는데 유용한 도구 입니다. 하지만 정확히 어느 그룹의 평균값이 의미가 있는지는 알려주지 않습니다. 따라서 추가적인 사후분석(Post Hoc Analysis) 이 필요합니다. 사후분석에 대해서는 다음 기회에 알아 보도록 하겠습니다.

Lab_materials

실험하면서 자주 사용되는 시약의 정보를 모아두겠습니다.


Table of contents

  1. Media
  2. Antibiotics
  3. Enzymes
  4. Buffers
  5. Other common chemicals

This category from OpenWetWare


# 1. Media --------

1.1. Defined Media

Synthetic Complete (SC) Medium aka (CSM, CM). Defined yeast medium

  • Adenine sulfate 20 mg/L
  • Uracil 20 mg/L
  • L-Tryptophan 20 mg/L
  • L-Histidine-HC1 20 mg/L
  • L-Arginine-HC1 20 mg/L
  • L-Methionine 20 mg/L
  • L-Tyrosine 30 mg/L
  • L-Leucine 60 mg/L
  • L-Isoleucine 30 mg/L
  • L-Lysine-HC1 30 mg/L
  • L-Phenylalanine 50 mg/L
  • L-Glutamic acid 100 mg/L
  • L-Aspartic acid 100 mg/L
  • L-Valine 150 mg/L
  • L-Threonine 200 mg/L
  • L-Serine 400 mg/L

1.2. Standard Media

Luria-Bertani Medium (aka L-Broth or LB Medium)

LB is a standard growth medium for a variety of bacteria and conditions.

Ingredients for 1L
  • 10 g Bacto-tryptone
  • 5 g yeast extract
  • 10 g NaCl

For plates also add

  • 15g agar

Note: There are two formulations of LB, Miller and Lennox, that differ in the amount of NaCl. Lennox has less salt, only 5 g/L. The Qiagen miniprep kit recommends LB with 10 g NaCl for highest plasmid yields.

Protocol
  1. Mix dry ingredients and add distilled water up to 1 Liter
  2. Pour into 2 L flask (or greater)
  3. Autoclave (liquid cycle) 250°F, 22psi, 30 minutes

Notes: We do not pH medium when we make it on the fly. However, if it is really important, pH the medium to 7.0 with 5M NaOH (~200µL). Check with pH paper

For plates
  1. Mix dry ingredients and add distilled water up to less than 1 Liter
  2. After dissolved, add agar
  3. Pour into 2 L flask (or greater)
  4. Autoclave (liquid cycle): 250°F, 22psi, 30 minutes
  5. Cool media down to 60-55C (uncomfortable but not painful)
  6. Add appropriate antibiotic
  7. Pour the solution on sterile plates
  8. Let solidify about 1 h, wrap, label (name, date, additive) keep at 4°C

# 2. Antibiotics

Kanamycin

Mode of Action

Bacteriocidal. Diffuses through the porinchannels in the outer membrane of gram-negative bacteria. Interacts with at least three ribosomal proteins, inhibiting protein synthesis and increasing translation errors.

Working Concentration and Stock Solution

Working concentration is 50 μg/ml, or 25 μg/ml for low-copy plasmids. Stock solution is 35 mg/ml in water (kanamycin is insoluble in 50% alcohol).

Ampicillin

Mode of Action

Inhibits the formation of cross-links in the peptidoglycan layer (which provides rigidity to the cell wall). Most effective against cells in log phase growth (since this is when new cross-links are being formed), and has little effect on cells in stationary phase.

Stock Solution

Typical concentrations of ampicillin are 50 μg/mL for low copy plasmids and 100 μg/mL for high copy plasmids. Stock solutions are typically at 100 mg/ml, so that 1 ml of antibiotic can be added to 1 liter of broth or agar. Stock solutions made in 50% alcohol remain liquid at -20°C and are easy to pipet. Cool agar to 55°C or below prior to adding antibiotic.

Ampicillin available from Sigma A-9518 (Ampicillin sodium salt), FW 371.39. To make 100ml of 100 mg/ml stock solution, dissolve 10 g of ampicillin in 50 ml of water and 50 ml of 100% ethanol.

Stability

Culture plates with ampicillin can be stored at 2-8°C for up to two weeks. (In our experience, plates are usable for 2-3 months when stored at 4°C and bagged to prevent evaporation.)
Stock solutions may be stored at 2-8°C for up to 3 weeks. For long term storage (4-6 months), stock solutions should be stored at -20°C. At 37°C in culture, ampicillin is stable up to 3 days. Sigma reference

Carbenicillin

Carbenicillin is used to select for cells bearing the beta-lactamase gene bla or ampR. Beta-lactamase also neutralizes ampicillin, but due to ampicillin's lower effectiveness, carbenicillin tends to give fewer satellite colonies and makes for more reliable cloning. Ampicillin is more widely used, though, due to carbenicillin's high cost ($75/g compared to ampicillin's $10/g (Sigma)).

Tetracycline

Tetracyclines (there are many) enter the cell by diffusion through OmpF porins. It inhibits growth by preventing codon-anticodon interactions during translation.

Working concentration and Stock Solutions

Using a plasmid with copy number 15-20, 20μg/ml tet has proved a successful selector. The reduction in growth rate, described above was significant. The RajBhandary lab uses 5μg/ml and see a minimal reduction in growth rate and never lose plasmid from their cells. Stock solution is 15 mg/ml in 50% ethanol (note: using 95-100% ethanol is much better for dissolving stock powder). Add 1 ml per liter of cooled broth or agar. Solutions and plates are light sensitive. Tetracycline is incompatible with (chelated by) magnesium ions, and thus unsuitable for use in minimal media

Streptomycin

Mode of Action

Inhibits protein synthesis; binds to 30S ribosomal subunit. The rpsL50 gene mutation conveys streptomycin resistance. Resistance is recessive, so the presence of a wild type rpsL gene will convey sensitivity. This allows counter-selection against the presence of the wt rpsL gene. Mutations of wild type genes to convey resistance are common.

Working concentration and Stock Solutions

100 mg/mL stock solution in water. It is barely soluble in ethanol so you must dissolve it in water. Add 1 mL per liter of culture.

Chloramphenicol

Chloramphenicol is a bacteriostatic agent that binds to the 50S ribosomal subunit and inhibits ribosomal peptide bond formation. It is sometimes used as a way of "amplifying" plasmid production by shutting down protein synthesis in cultures, while allowing plasmid replication to continue.

Working Concentrations and Stock Solutions

High copy plasmids allow 35 μg/ml concentration. Low-copy plasmids like bacterial artificial chromosomes allow 12.5 μg/ml. Stock solutions can be made at 35 mg/ml in ethanol, kept at -20C.

Neomycin

binds to ribosomal components and inhibits protein synthesis. target organism: bacteria, mammalian cells

Working Concentrations and Stock Solutions

stock solution: 10mg/ml in water working solution: 10-50µg/ml (stable at RT for 5days)


3. Enzymes


3.1 Phosphatases

Antarctic Phosphatase

This enzyme is preferable to either Calf Intestinal Phosphatase or Shrimp Alkaline Phosphatase because it is more easily heat-inactivated. After phosphatase treatment of a vector, any remaining enzymatic activity would remove 5' phosphates from the insert in a subsequent ligation reaction, preventing a successful reaction. Thus, it is critical that the phosphatase be inactivated, purified away, or digested with Proteinase K.

3.2. DNA polymerases

Pfu

Pfu has proof reading activity which makes it more accurate than for example Taq. Pfu does not exhibit 5' -> 3' exonuclease activity, which prevents degradation of the primers in mutagenesis methods based on ligation of primers or elongated primers. Pfu is slower than Taq and one should calculate 2 min per 1000 bp (1 min for Taq)

T4 DNA Polymerase

LIC quality T4 DNA Polymerase can be used to "chewback" 3' ends for use in Ligation Independent Cloning (LIC).

3.3. DNA ligases

DNA ligase is a specific type of enzyme, a ligase, that facilitates the joining of DNA strands together by catalyzing the formation of a phosphodiester bond.

3.4. RNA Polymerases

3.5. Restriction enzymes

Restriction enzymes are used for performing a restriction digest.

Comprehensive information can be found in REBASE. Neeraj Dhar's good cutter list is a list of enzymes which are particularly reliable and straightforwards.

This is another source of info on restricition enzymes. It's by the same people who provide REBASE, but in a different format:

3.6. Other

Lysozyme

Breaks down cell walls by catalyzing the hydrolysis of cell wall peptidoglycans.


4. Buffers


TAE

TAE is a commonly used buffer for making and running DNA agarose gels. It offers a few advantages and disadvantages compared to TBE buffer:

  • TAE buffer provides optimal resolution of fragments >4 kb in length, while TBE provides better resolution for 0.1 to 3 kb fragments.
  • TAE is significantly cheaper to make
  • TAE stocks can be 50X concentrated and therefore take up less space than 10X concentrated TBE stock
  • TBE offers a higher resolution and has a higher buffering capacity at greater temperatures induced by relatively higher voltages
  • TBE can negatively influence the yield of DNA after recovery from a gel when using glass-based protocols. Yield does not seem to be influenced when using silica-based methods.
  • The bromide ion in TBE is a powerful inhibitor of enzyme activity. This can prevent degradation of nucleic acid, but can also interfere with subsequent experiments like cloning DNA into a vector.

Ingredients for one litre 50X stock

  • Tris-base: 242 g
  • Acetate (100% acetic acid): 57.1 ml
  • EDTA: 100 ml 0.5M sodium EDTA

Add dH2O up to one litre.

To make 1x TAE from 50X TAE stock, dilute 20ml of stock into 980 ml of deionised water.

PBS

PBS stands for phosphate buffered saline and is often used because it is isotonic to cells.

Recipe

1x PBS is 137 mM NaCl, 12 mM Phosphate, 2.7 mM KCl, pH 7.4
To make 10x PBS,

  1. Combine the following:
    • 80 g NaCl
    • 2 g KCl
    • 14.4 g Na2HPO4 (dibasic anhydrous) OR 18.1 g Na2HPO4 · 2H2O (dibasic dihydrate) OR 27.2 g Na2HPO4 · 7H2O (dibasic heptahydrate)
    • 2.4 g KH2PO4 (monobasic anhydrous)
    • 800 mL distilled H2O
  2. Adjust pH to 7.4 with HCl
  3. Add H2O to 1L
  4. Autoclave for 20 minutes on liquid cycle. Store at RT.

PBST

For some applications (washing) one adds Tween 20 PBS which makes PBST

  • Usual concentrations of Tween 20 in PBST are 0.1 or 0.05%.
  • Usual concentration of Triton X-100 in TBST is 0.1-0.3%.

TE

TB buffer is used to make chemically competent cells by the Inoue method Preparing chemically competent cells (Inoue)

Preparation: For a 1 liter solution add

  • 250 mM potassium chloride (18.65 g)
  • 15 mM calcium chloride (2.2 g)
  • 10 mM PIPES (20 ml of an 0.5 M solution)

to 800 ml of DI water. Adjust pH to 6.7 with 1M KOH.

Dissolve

  • 55 mM manganese chloride (10.88 g)

in 100 ml of DI water, add gradually to the solution of the remaining components. Bring solution to 1 liter. The pH of the solution will fall, which is expected.

Do not attempt to adjust the pH. Adding base after adding manganese will precipitate a yellow/brown hydroxide.

Sterile filter with a pre-washed 0.22 μm filter and store indefinitely at 4 degrees.

NEB buffers

  • NEBuffer 1 (yellow)

    • 10 mM Bis Tris Propane-HCl, 10 mM MgCl2, 1 mM DTT (pH 7.0 at 25°C).
  • NEBuffer 2 (blue)

    • 10 mM Tris-HCl, 10 mM MgCl2, 50 mM NaCl, 1 mM DTT (pH 7.9 at 25°C).
  • NEBuffer 3 (red)

    • 50 mM Tris-HCl, 10 mM MgCl2, 100 mM NaCl, 1 mM DTT (pH 7.9 at 25°C).
  • NEBuffer 4 (green)

    • 20 mM Tris-acetate, 10 mM magnesium acetate, 50 mM potassium acetate, 1 mM DTT (pH 7.9 at 25°C).
  • T4 DNA Ligase Buffer

    • 50 mM Tris-HCl, 10 mM MgCl2, 1 mM ATP, 10 mM Dithiothreitol (pH 7.5 at 25°C).
  • NEBuffer EcoRI

    • 100 mM Tris-HCl, 50 mM NaCl, 10 mM MgCl2, 0.025 % Triton X-100 (pH 7.5 at 25°C).

SDS sample buffer

For preparation and loading of protein samples onto a gel for SDS-PAGE analysis (Western blot/protein blot).

Preparation:

SDS contained in the sample buffer makes proteins negatively charged proportionally to their length. 2-mercapto-ethanol/DTT breaks disulphide bonds.

Loading:

glycerol makes the sample buffer more dense than the surrounding running buffer of the protein gel, enabling easy loading into the gel pockets.

To make 10 mL of 4x stock

  • 2.0 ml 1M Tris-HCl pH 6.8
  • 0.8 g SDS
  • 4.0 ml 100% glycerol
  • 0.4 ml 14.7 M β-mercaptoethanol
  • 1.0 ml 0.5 M EDTA
  • 8 mg bromophenol Blue

Final concentrations (1x)

  • 50 mM Tris-HCl pH 6.8
  • 2% SDS
  • 10% glycerol
  • 1% β-mercaptoethanol
  • 12.5 mM EDTA
  • 0.02 % bromophenol blue

TBS

TBS stands for tris-buffered saline and is used in many protocols for washing Endconzentration in 1x TBS will be 20mM Tris and 150mM NaCl

5xTBS for 1 liter with stock solutions:

  • 100 ml 1M Tris-HCl pH 7.5
  • 150 ml 5M NaCl
  • 750 ml ddH2O

Prepare and sterilize filter. Store at 4°C. After diluting the 5x to 1x filter again

TBST

TBST: For some applications one adds Tween 20 or Triton X-100 to TBS which makes TBST (people use the name TBST for both, probably it's lab dependent.

  • Usual concentrations of Tween 20 in TBST are 0.1 or 0.05%.
  • Usual concentration of Triton X-100 in TBST is 0.1%.

Killing Buffer

Killing Buffer is used to kill bacterial cells. Usually one would use cold Killing buffer in a 50:50 ratio to harvest cells in liquid medium. The buffer contains NaN3 and kills the cells quickly. Fast killing and down cooling is important for example to stop RNases if you want to isolate RNA from that specific time point of cell growth.

Endconzentration will be 20mM Tris and 5mM MgCl2 and 20mM NaN3

Killing Buffer for 1 liter:

  • 2 ml 1M Tris-HCl pH 7.5
  • 0.5 ml 1M MgCl2
  • 1.3 g NaN3
  • 997.5 ml ddH2O

Store at 4°C.

Note that NaN3 is highly toxic and should be handled accordingly.


5. Other common chemicals


Ethidium bromide

Ethidium Bromide, often abbreviated EtBr. Intercalating agent that inserts itself between DNA base pairs Fluorescent under UV light

Caution: Carcinogen/Mutagen

IPTG

Isopropyl-beta-D-thiogalactopyranoside Induces transcription from promoters regulated by LacI repressor. Molecular weight is 238.31 g/mol. The chemical formula is here.

  1. Dissolve 1g in 4196 μL deionized water to make 1M solution.
  2. Filter sterilize with syringe and 0.22μm filter

Alternatively

  1. Dissolve 238 mg IPTG (sigma I-6758) in 10 ml DW for (100mM stock)
  2. store in 1 ml aliquotes at -20°C

Generally a 1mM solution is an effective amount to induce the pLac promoter region.

EDTA

EDTA stands for ethylene-diamine-tetraacetic acid. It chelates divalent cations and is therefore used in many buffers. Its relative EGTA has a higher affinity for calcium than for magnesium ions. edta

0.5 M EDTA stock

  • 18.61 g EDTA (Sodium Salt)
  • dH2O to 90 ml
  • adjust pH to 7.0
  • adjust volume to 100 ml

0.5M 500ml pH 8.0 with NaOH pre-calculated

  • 93.05g of Na2.EDTA (FW 372.2)
  • 10.14g of NaOH (FW 40)
  • 500 ml dH2O

Phenol

Phenol is an important chemical in biological research. It is used primarily for isolation and purification of DNA and RNA. Pure phenol is solid at room temperature. Almost all biochemical uses of phenol use a water saturated phenol solution, or a mixture of phenol in chloroform.

Safety

Phenol is caustic and causes nasty chemical burns which are slow to heal. It is also a systemic poison which can be rapidly absorbed through the skin.

Glucose

The molecular weight of anhydrous glucose is 180.16 g/mol. To make 1M glucose, dissolve 18g in 100mL H2O. 1M glucose = 18% glucose

PMSF

PMSF stands for Phenylmethylsulfonyl fluoride. It is used as an irreversible inhibitor of serine protease activity for example in protein purification or co-immunoprecipitation. It has no effect on other proteases that might be present. It can also inactivate some non-proteolytic enzymes that contain serine in the active site.

Use

PMSF is not soluble in water, dissolve in isopropanol, ethanol or methanol. 200 mM stock solution (0.035 g/mL) is stable for months at 4°C, although storage at -20 is probably better. A common final concentration of PMSF is 1mM. PMSF is extremely toxic, and preparation of solutions should be done very carefully. PMSF is not stable in alkaline conditions, or in water. For disposal, add alkali in water and wait to dispose.

Safety

PMSF is extremely poisonous and should be handled carefully.

Glycerol

Glycerol is an alcohol often used in the lab to avoid freezing and increase the density of liquids. Its standardised IUPAC name is propane-1,2,3-triol. It is also often referred to as glycerine or glycerin.

Density

Glycerol is used in many loading buffers to increase the density of the solution which enables good loading into already liquid filled wells of agarose and polyacrylamide gels. See agarose gel loading buffer and SDS sample buffer.

Freezing

Glycerol is also used to decrease the freezing point of protein solution which are easily degraded by repeated freeze/thaw cycles. A 50% (w/v) glycerol/water solution freezes at -25ºC and can thus be stored in a -20ºC freezer. Note that glycerol with 1.26g/cm3 is much denser than water and, therefore, a 50% (v/v) solution will be very different from a 50% (w/v) solution. For example, restriction enzymes are typically stored in a glycerol/water solution.

Ficoll

Ficoll™ is a sucrose polymer commonly used to adjust the density and viscosity of solutions in the lab. It is employed in density gradients and loading buffers.

Use

One of the most common applications for Ficoll is as a density gradient for the isolation of eukaryotic cells, organelles, bacterial cells, and pancreatic islets.

Properties

Ficoll is an uncharged, highly branched polymer formed by the co-polymerisation of sucrose and epichlorohydrin. Due to its many hydroxyl groups, Ficoll is highly soluble in water. Densities of up to 1.2 g/ml can be attained.

Stability & storage

Since Ficoll does not contain any ionized groups, the molecule does not readily react under physiological conditions. It is stable in alkaline and neutral solutions, but is rapidly hydrolysed in solution at pH 3, especially at high temperature. However, Ficoll can be autoclaving at 110ºC for 30 minutes in neutral solutions. Strong oxidizing and reducing agents should be avoided. Shipping and storage are at ambient temperatures.

DNA dyes

DNA dyes stain deoxyribonucleic acid for laboratory purposes such as detection and quantification. Many DNA dyes also bind to RNA and could be more broadly described as nucleic acid stains. Common dyes included ethidium bromide (EtBr), esp. for agarose gel electrophoresis of DNA, and DAPI for staining the cell nucleus in fluorescent microscopy.

Dyes for DNA in agarose gel electrophoresis

  • Ethidium bromide : traditional DNA stain.
  • SYBR Gold : 10x more sensitive than EtBr
  • SYBR Green : 20x more sensitive than EtBr
  • SYBR Safe : sensitivity similar to EtBr

DNTPs

dNTP's or deoxynucleosides are the monomers that DNA polymerase uses to form DNA.

ProteinA Sepharose Beads

P3391 ProteinA-Sepharose beads from Staphylococcus aureus. Lyophilized powder

Tween 20

Tween (full name Tween 20 or Polysorbate 20) is a common detergent used in biology used in cell lysis and membrane protein solubilisation among other uses.

Triton X-100

Triton (full name Triton X-100) is a common, nonionic detergent used for solubilisation, permeabilisation, and lysis.

SDS

SDS (sodium dodecyl sulfate/sulphate) is an anionic detergent effective in both acidic and alkaline solutions. SDS has a wide variety of applications, but is most often used in protein and lipid solubilisation.

Warnings

  • highly flammable
  • harmful in contact with skin and if swallowed
  • irritating to eyes, respiratory system and skin