I wrote this class to help me do some screen scraping programs. It has two Dictionary objects. One contains the HTML codes and their text equivalents and the other contains the numeric HTML codes and their text equivalents.

Below I listed some of the contents of both dictionaries to give an idea of how I did it. For the complete list please download the zip file from link below.

internal class HTMLCodes
{
    public readonly Dictionary<string, string> Code = 
        new Dictionary<string, string>();
    public readonly Dictionary<string, string> NumericCode = 
        new Dictionary<string, string>();

    public HTMLCodes()
    {
        Code.Add("& ndash;", "-");
        Code.Add("& lsquo;", "`");
        Code.Add("& rsquo;", "'");
        Code.Add("& sbquo;", ",");
        Code.Add("& quot;", "\"");

        NumericCode.Add("& #34;", "\"");
        NumericCode.Add("& #39;", "`");
        NumericCode.Add("& #38;", "&");
        NumericCode.Add("& #60;", "less");
        NumericCode.Add("& #62;", "greater");
    }
}

One more thing, I included all the codes I found. If anyone finds out that I missed one please let me know. Hope you find it useful.