{"id":65,"date":"2022-05-16T20:52:28","date_gmt":"2022-05-16T19:52:28","guid":{"rendered":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/?page_id=65"},"modified":"2022-05-16T20:58:59","modified_gmt":"2022-05-16T19:58:59","slug":"text","status":"publish","type":"page","link":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/text\/","title":{"rendered":"Text"},"content":{"rendered":"<p><span data-contrast=\"none\">Characters can also be represented in binary. Characters are usually grouped together in what is known as a<\/span><b><span data-contrast=\"none\">\u00a0character set<\/span><\/b><span data-contrast=\"none\">. A character set is more than just the letters that form a language.<\/span><\/p>\n<p><span data-contrast=\"none\">Characters include alphanumeric data (letters and numbers), symbols (*, &amp;, : etc.) and control characters (Shift, Escape etc.).<\/span><\/p>\n<p><span data-contrast=\"auto\">Each character is represented by a unique\u00a0<\/span><b><span data-contrast=\"none\">binary code<\/span><\/b><span data-contrast=\"auto\">.<\/span><\/p>\n<h2 style=\"color: red\">ASCII Code<\/h2>\n<p><span data-contrast=\"none\"><strong>Standard ASCII<\/strong> uses 7 bits to represent each character <\/span><\/p>\n<p><span data-contrast=\"none\">from\u00a0<\/span><b><span data-contrast=\"none\">0000000<\/span><\/b><span data-contrast=\"none\">\u00a0to\u00a0<\/span><b><span data-contrast=\"none\">1111111\u00a0<\/span><\/b><span data-contrast=\"none\">which can be used to store\u00a0<\/span><b><span data-contrast=\"none\">128<\/span><\/b><span data-contrast=\"none\">\u00a0characters.<\/span><\/p>\n<p><b><span data-contrast=\"none\">The codes 0 &#8211; 31 represent control characters which are special non-printing characters<\/span><\/b><\/p>\n<p><b><span data-contrast=\"none\">e.g TAB and Return.<\/span><\/b><\/p>\n<p><b><span data-contrast=\"auto\">Extended ASCII<\/span><\/b><span data-contrast=\"auto\">, which was created when IBM designed its PCs, uses 8 bits to represent each character.<\/span><\/p>\n<p><span data-contrast=\"auto\">from\u00a0<\/span><b><span data-contrast=\"auto\">0000000<\/span><\/b><span data-contrast=\"auto\">\u00a0to\u00a0<\/span><b><span data-contrast=\"auto\">1111111<\/span><\/b><\/p>\n<p><span data-contrast=\"auto\">which can be used to store\u00a0<\/span><b><span data-contrast=\"auto\">256<\/span><\/b><span data-contrast=\"auto\">\u00a0characters.<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-68\" src=\"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/public\/phshighercomputingscience\/uploads\/sites\/10042\/2022\/05\/16205446\/GetImage-1-300x276.png\" alt=\"\" width=\"424\" height=\"390\" srcset=\"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/public\/phshighercomputingscience\/uploads\/sites\/10042\/2022\/05\/16205446\/GetImage-1-300x276.png 300w, https:\/\/blogs.glowscotland.org.uk\/glowblogs\/public\/phshighercomputingscience\/uploads\/sites\/10042\/2022\/05\/16205446\/GetImage-1.png 363w\" sizes=\"auto, (max-width: 424px) 100vw, 424px\" \/><\/p>\n<h2 style=\"color: red\"><strong>Unicode<\/strong><\/h2>\n<p><span data-contrast=\"auto\">The main issue with using ASCII or Extended ASCII is that 128 or 256 characters limit the amount of character sets that can be held. Representing the character sets for several different language structures is not possible in ASCII, there are just not enough available characters.<\/span><\/p>\n<p><span data-contrast=\"auto\">Unicode was created to overcome this issue. Unicode uses 16 bits to represent each character. This means that Unicode is capable of representing 65,536 different characters and a much wider range of character sets.<\/span><\/p>\n<ul style=\"font-weight: 400\">\n<li data-aria-posinset=\"1\" data-aria-level=\"1\"><span data-contrast=\"auto\">Unicode can represent <strong>65,536 charaters<\/strong><\/span><strong>\u00a0<\/strong><\/li>\n<li data-aria-posinset=\"2\" data-aria-level=\"1\"><span data-contrast=\"auto\">Unicode uses <strong>16 bits<\/strong> to represent each character<\/span><\/li>\n<li data-aria-posinset=\"3\" data-aria-level=\"1\"><span data-contrast=\"auto\">Unicode can represent a greater range of character sets than ASCII<\/span><\/li>\n<li data-aria-posinset=\"4\" data-aria-level=\"1\"><span data-contrast=\"auto\">There are adapted forms of the original Unicode standard capable of representing millions of characters<\/span><\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-69\" src=\"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/public\/phshighercomputingscience\/uploads\/sites\/10042\/2022\/05\/16205600\/GetImage-2-300x153.png\" alt=\"\" width=\"537\" height=\"274\" srcset=\"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/public\/phshighercomputingscience\/uploads\/sites\/10042\/2022\/05\/16205600\/GetImage-2-300x153.png 300w, https:\/\/blogs.glowscotland.org.uk\/glowblogs\/public\/phshighercomputingscience\/uploads\/sites\/10042\/2022\/05\/16205600\/GetImage-2.png 311w\" sizes=\"auto, (max-width: 537px) 100vw, 537px\" \/><\/p>\n<h2 style=\"color: red\"><strong>Advantages and Disadvantages<\/strong><\/h2>\n<ul>\n<li><span class=\"TextRun SCXO246367013 BCX8\" lang=\"EN-GB\" xml:lang=\"EN-GB\" data-contrast=\"auto\"><span class=\"NormalTextRun SCXO246367013 BCX8\">Due to a much larger character set, additional language characters could be identified<\/span><\/span><span class=\"EOP SCXO246367013 BCX8\">\u00a0<\/span><\/li>\n<li><span class=\"TextRun SCXO57995727 BCX8\" lang=\"EN-GB\" xml:lang=\"EN-GB\" data-contrast=\"auto\"><span class=\"NormalTextRun SCXO57995727 BCX8\">The first 128 code were identical to the original ASCII codes so compatibility with ASCII was maintained<\/span><\/span><span class=\"EOP SCXO57995727 BCX8\">\u00a0<\/span><\/li>\n<li><span class=\"TextRun SCXO46902283 BCX8\" lang=\"EN-GB\" xml:lang=\"EN-GB\" data-contrast=\"auto\"><span class=\"NormalTextRun SCXO46902283 BCX8\">Because it was a 16-bit code, each character took double amount of memory to store as ASCII so file sixes increased as did transmission times.<\/span><\/span><span class=\"TextRun SCXO46902283 BCX8\" lang=\"EN-GB\" xml:lang=\"EN-GB\" data-contrast=\"none\"><span class=\"NormalTextRun SCXO46902283 BCX8\">\u00a0<\/span><\/span><span class=\"EOP SCXO46902283 BCX8\">\u00a0<\/span><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Characters can also be represented in binary. Characters are usually grouped together in what is known as a\u00a0character set. A character set is more than just the letters that form a language. Characters include alphanumeric data (letters and numbers), symbols (*, &amp;, : etc.) and control characters (Shift, Escape etc.). Each character is represented by [&hellip;]<\/p>\n","protected":false},"author":79984,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-65","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/pages\/65","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/users\/79984"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/comments?post=65"}],"version-history":[{"count":3,"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/pages\/65\/revisions"}],"predecessor-version":[{"id":70,"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/pages\/65\/revisions\/70"}],"wp:attachment":[{"href":"https:\/\/blogs.glowscotland.org.uk\/glowblogs\/phshighercomputingscience\/wp-json\/wp\/v2\/media?parent=65"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}