EXTRA BITS: SGML HTML XML - Computerphile

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 พ.ย. 2024

ความคิดเห็น • 96

  • @johanneslade2830
    @johanneslade2830 8 ปีที่แล้ว +94

    please do more historic stuff like this. It is very interesting to hear this sort of thing from some one who was actually there. Also it is easier to grasp certain concepts when you know the story how they came to be. thank you very much for sharing

  • @DIECARS1
    @DIECARS1 8 ปีที่แล้ว +18

    You've just got to love this professor

  • @ThatNateGuy
    @ThatNateGuy 8 ปีที่แล้ว +5

    I could listen to Prof. Brailsford speak for hours.

  • @GilesBathgate
    @GilesBathgate 8 ปีที่แล้ว +39

    +Computerphile Apple pie is an application of apples, not a subset! Bramley apples are a subset of apples. Bramley apple pie is an application of bramley apples!

  • @zissou6928
    @zissou6928 8 ปีที่แล้ว +109

    Proof it's possible to still be cool at 80

    • @11Kralle
      @11Kralle 8 ปีที่แล้ว +2

      +9enius It always was. And it always was possible to be dull at 17 as well!

    • @zissou6928
      @zissou6928 8 ปีที่แล้ว +1

      ProfDaveB

  • @taneliharkonen2463
    @taneliharkonen2463 6 ปีที่แล้ว +2

    Wow! this series cleaned a lot of things! :) I've noted the similarities in xml html and svg syntax and allways wondered whats with xml, html, xhtml and now html5. Thank you so much! :D

  • @computertutoring
    @computertutoring 3 ปีที่แล้ว

    Nice to hear someone who was actually there at the start 👍🏽

  • @ChunkyChest
    @ChunkyChest 8 ปีที่แล้ว +16

    can we get some history of regular expressions please

  • @reddcube
    @reddcube 8 ปีที่แล้ว +5

    Does SGML have a generic closing bracket? ie to close any leaf and go to its parent, no matter the leaf's type.

  •  8 ปีที่แล้ว +6

    Great video :-) It would be very interesting if you did similar series on TeX!

  • @otakuribo
    @otakuribo 8 ปีที่แล้ว +11

    I have been tinkering with SVG in Inkscape for years, when Professor Brailsford just mentioned off the cuff that the standard has Adobe PDF origins.
    May I suggest SVG as a future topic? :D

  • @loftyblond
    @loftyblond 8 ปีที่แล้ว +3

    These markup videos are wonderful. However, to my mind, HTML is aptly named while the naming of SGML and XML is confusing. I used to work with both SGML and XML, and the thing we always had to stress when teaching was that XML and its predecessor were not actual markup languages, because languages have words (predefined tags) and grammar (DTD).

  • @MarkBTomlinson
    @MarkBTomlinson 5 ปีที่แล้ว +1

    I was studying Computer Science at Cardiff in 93 and remember well the first time we saw HTML and had to write our first Webpage, little did we know where the funny little language would end up.

    • @invisibledave
      @invisibledave 3 ปีที่แล้ว +1

      I was in college at that time for comp sci too but they didn't know what HTML was at that time. It was just something I ran into later on after getting a degree and you had to teach yourself by looking at existing pages online (since nothing was complicated back then) and copying their code and then playing with tags to see what happened.
      Never heard of SGML and I had never heard of XML until recent years.

    • @MarkBTomlinson
      @MarkBTomlinson 3 ปีที่แล้ว

      @@invisibledave I think we were lucky they made a big thing about it being something new and interesting. Thinking back it was pretty basic stuff though.

  • @JLSoftware
    @JLSoftware 8 ปีที่แล้ว +2

    Dang - I've watched all of these now and I still haven't heard him actually define any of these.

  • @arbazna
    @arbazna 8 ปีที่แล้ว +21

  • @IshayuG
    @IshayuG 8 ปีที่แล้ว +1

    What about a video on XSD? Think that could be pretty interesting as an extension to this video as Brailsford has now told us that there are tens of thousands of XML based tagsets. How do we work with and define these?
    Well, XSD is an XML Schema Definition - it is itself XML that defines a tagset and maps it to data types, essentially defining an XML-based data format.
    You can then send XSD's off to people and then they can put it into programs and then, when that program receives XML based on that XSD, their program knows exactly how to parse it and give meaningful data to the programmer.
    This is really extensively used for web services. For example, if you have Visual Studio, you can import a "service reference" which uses an XSD.
    So much more can be said. Please do it! :D

    • @noxabellus
      @noxabellus 8 ปีที่แล้ว

      wow yes I really hope they talk about this. I was unaware of such but it seems a key component of the whole picture

  • @IkeFoxbrush
    @IkeFoxbrush 3 ปีที่แล้ว

    "If all the end tags were there, you could do a brackets matching operation, in a sense a bit like in a compiler where you match open curly brace with close curly brace. You could tick everything off and say 'That is a tree'. I may not understand what all the tags mean, but boy they match alright."
    Incidentially, that's strikingly similar to how I approach any advanced math formula XD

  • @mikeklaene4359
    @mikeklaene4359 7 ปีที่แล้ว

    I love it! There will always be a better way to do something - and MY way will always be better than YOUR way!
    What I really liked was that pages that would display quite well under Netscape would not on Internet Explorer and vice-versa. Microsoft, staying true to their nature, always had non-standard ways of doing things.

  • @dealloc
    @dealloc 8 ปีที่แล้ว +1

    And you still have separate communities fighting for how the spec should be-take a look at WhatWG (founded by individuals from the main browser vendors), which W3C tend to copy.

  • @okibre
    @okibre 4 ปีที่แล้ว +1

    What would SGML people say about Json/Bson?

  • @salasart
    @salasart 8 ปีที่แล้ว +3

    man, I love this professor =)

    • @UntouchedWagons
      @UntouchedWagons 8 ปีที่แล้ว

      +Luis Salas Lastra Agreed. He is so easy to listen to.

  • @ahmidahmid9303
    @ahmidahmid9303 5 ปีที่แล้ว

    thanks prof you are the best

  • @EthanLR
    @EthanLR 4 ปีที่แล้ว

    Now what is Professor Brailsford's opinion on JSON?

  • @lethargogpeterson4083
    @lethargogpeterson4083 5 ปีที่แล้ว

    I would be curious to hear Prof. Brailsford's thoughts on WhatWG and the reasons HTML5 was not created purely as XHTML5.

    • @lawrencedoliveiro9104
      @lawrencedoliveiro9104 2 ปีที่แล้ว

      XHTML was a failure. It was basically HTML reformulated with the strictness of XML syntax, which nobody could be bothered to follow -- not for web pages, anyway.

  • @VorpalGun
    @VorpalGun 8 ปีที่แล้ว +1

    What about HTML5? It isn't XHTML 5 after all, so presumably W3C gave up on the strict subset version.

    • @martygentillon2786
      @martygentillon2786 7 ปีที่แล้ว +1

      Only sortof, after all, they are also maintaining XHTML5. This is the same as the basic parity between HTML4 and XHTML4. XHTML just doesn't seem likely to replace HTML anymore.

  • @Bob_Burton
    @Bob_Burton 8 ปีที่แล้ว +8

    I don't understand why you have to be tolerant of missing end tags. Just because some people hand code it does not relieve them from responsibility for doing it correctly.

    • @fnorgen
      @fnorgen 8 ปีที่แล้ว +20

      +UKHeliBob Because a browser that does not accept missing end tags would be unable to properly display loads of poorly written websites, and most users would blame the browser and switch to a more forgiving one. It's similar to voice to text programs. Users gravitate towards ones that understand poor pronunciation.
      Missing end tags were allowed at one point, so now we're stuck with it sadly.

    • @profdaveb6384
      @profdaveb6384 8 ปีที่แล้ว +10

      +fnorgen
      Exactly!

    • @ZipplyZane
      @ZipplyZane 8 ปีที่แล้ว +2

      +UKHeliBob They explained it already. If you don't tolerate errors, then people will just not use your software, and will go to someone who will.
      A whole lot of computing is based on the idea of error correction. And it makes a ton of sense to extend this, when possible, to anything we error-prone humans make. Especially since humans design the computers and thus can have errors even in the computer-made stuff.
      Trying to force humans to not make errors is a losing proposition. It would only limit those who would use your stuff. A huge reason for the growth of the web is how error tolerant it is.

    • @chsxtian
      @chsxtian 8 ปีที่แล้ว

      +UKHeliBob It's also because of compatibility. Not everyone was a fan of XHTML, and If you were to open a website that was designed for 'normal' HTML, then end tags could be ommitted.

    • @tamasdemjen4242
      @tamasdemjen4242 8 ปีที่แล้ว +1

      +UKHeliBob It depends how you look at it. HTML can be as simple as a few paragraphs of text, or as complicated as a word processor or a PDF editor. Most designers aren't seasoned engineers, not to mention simple bloggers. Imagine if you had to run a compiler just to post a message on Facebook. I do agree, however, that if you're developing apps, you should follow extremely strict rules. Then HTML is no longer just a document, but rather a whole platform, the browser is your virtual machine, and JavaScript is your assembly / machine code. It doesn't mean you have to program in HTML, you can compile your application into HTML, a special form of executable.

  • @lethargogpeterson4083
    @lethargogpeterson4083 5 ปีที่แล้ว +1

    Interesting. I never knew SVG was related to PDF.

  • @network_king
    @network_king 8 ปีที่แล้ว

    I think the newer MS office files are somehow XML based. I know in office 03 it was just like a .doc was a word specific file. The newer versions use .docx which is the SML version. I think they did that so that things like open office, etc can open them simpler.

    • @shadowace112
      @shadowace112 8 ปีที่แล้ว +5

      +jtech0 a .docx file is actually a zip file that contains XML files. You can check it out yourself by renaming the file from a .docs to a .zip and then opening it with WinRAR, 7-Zip, or whatever you feel like.

  • @EdgyNumber1
    @EdgyNumber1 8 ปีที่แล้ว

    so hang on a sec, was VRML a subset of XML then?
    In fact, it would be interesting to see a video about VRML and what actually happened to it.

  • @BeenOrange
    @BeenOrange 8 ปีที่แล้ว

    you forgot a link in the description!

  • @josephfox9221
    @josephfox9221 4 ปีที่แล้ว

    so wait im confused is HTML a subset of SGML?

  • @jessemckeown5628
    @jessemckeown5628 8 ปีที่แล้ว

    DTDs seem to be written in "angle bracket" notation; can DTD be an application of SGML? Or XML-DTD an application of XML?

    • @boenrobot
      @boenrobot 8 ปีที่แล้ว

      +jesse mckeown No. DTD is standard across tag sets, making it part of the meta language, and not the application. Besides, the notation isn't "the" angle bracket notation... the syntax rules in DTD are slightly different.

  • @b.bergeron8769
    @b.bergeron8769 5 ปีที่แล้ว

    I have a book that say HTML is an extension of SGML... Is it technically wrong?

  • @kinxofsepluv
    @kinxofsepluv 8 ปีที่แล้ว

    I'm curious now that HTML5 will be considered an ML now that web components will be available.

  • @shadowace112
    @shadowace112 8 ปีที่แล้ว +4

    JSON FOR LIFE!!!!!

  • @sofia.eris.bauhaus
    @sofia.eris.bauhaus 8 ปีที่แล้ว +10

    and with html5 they decided that xhtml is basically irrelevant. :(

    • @ZipplyZane
      @ZipplyZane 8 ปีที่แล้ว +5

      +sofias. orange It kinda already was, since browsers had to handle both HTML and XHTML. I mean, XHTML with error tolerance is basically just HTML with error tolerance.
      Plus, who needs to verify (X)HTML anymore? Browsers now just try to display what they can, and fail when they fail. The push in browsers is to be as fast as possible, and verification of any kind takes time.
      Developers verify their markup using separate programs, rather than relying on the browser to do it.
      Having two different standards for every HTML version is just taxing on browsers, so they went for the one that can easily contain the other. HTML5 will understand XHTML just fine.

    • @ZipplyZane
      @ZipplyZane 8 ปีที่แล้ว +2

      sofias. orange I don't know this for sure, but I think that it's not any faster. It just assumes properly formatted code, and as long as that is true, the speed is fine. Only badly formatted code is slower.
      I think what happened is that differences got smaller, to the point where they were mostly stylistic. The only thing I am aware of is that you don't have to includes the slash on self-closing tags (though you can if you want.) Technically, HTML5 says you should close your tag, even if it still works when you don't.
      Well, that and that HTML5 still isn't case sensitive. But, as long as you test for the correct case first, that's not really a problem.

    • @tiagorodrigues3730
      @tiagorodrigues3730 4 ปีที่แล้ว

      Curiously enough, XHTML still lives on in the JSF container ecosystem, which is quite large in corporate environments even in 2020. So not _quite_ irrelevant, though one might make an argument that it should. I won't, but one might.

  • @whatthefunction9140
    @whatthefunction9140 8 ปีที่แล้ว +1

    and now we have html5 and we have forgotten about all this.

  • @insu_na
    @insu_na 8 ปีที่แล้ว +1

    I always close all of my tags. And then my HTML minifier throws the omittable ones in the bin again. ;)

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 2 ปีที่แล้ว

    7:25 Unfortunately, Sir Tim was seen by many to have sold out his soul over the Encrypted Media Extensions saga.

  • @normannormiemates4844
    @normannormiemates4844 8 ปีที่แล้ว

    Interesting video.... ML

  • @Springwight
    @Springwight 7 ปีที่แล้ว +2

    Of course, now we have HTML5 which is not an application of either SGML or XML, but at least has its own spec.

  • @jkreitlow
    @jkreitlow 8 ปีที่แล้ว

    Wow, he really seems to like ripping into Dreamweaver.

    • @superdau
      @superdau 8 ปีที่แล้ว

      +John Kreitlow
      I don't really see him "ripping into Dreamweaver", merely mentioning it since it was one of the most popular website creation programs at the time. Anyway ripping into it would be fully justified, because the code it created was atrocious.

  • @sillyfly9530
    @sillyfly9530 8 ปีที่แล้ว +2

    Yet to this very day html5 is not xml-compliant, as for example you can't have empty `script` or `span` tags.

    • @belst_
      @belst_ 8 ปีที่แล้ว +3

      +sillyfly html5 is not xhtml.

    • @sillyfly9530
      @sillyfly9530 8 ปีที่แล้ว +1

      ***** Which is mind-boggling. You'd have thought they'd learned by now.

    • @ScarfmonsterWR
      @ScarfmonsterWR 8 ปีที่แล้ว +4

      +sillyfly They did learn. They focused on html5 because there was no point in pushing something people didn't want to use.

  • @darz_k.
    @darz_k. 4 ปีที่แล้ว

    ..oh yes.. it's a different day from the other videos..
    ..wink wink

  • @BEP0
    @BEP0 8 ปีที่แล้ว

    Nice.

  • @trackbit9hopugneru660
    @trackbit9hopugneru660 8 ปีที่แล้ว

    Huge :P hat off 5:08

  • @MattyTeare
    @MattyTeare 8 ปีที่แล้ว

    Discuss omitted php end tags

    • @sofia.eris.bauhaus
      @sofia.eris.bauhaus 8 ปีที่แล้ว +6

      +MattyTeare i would suggest ommitting php altogether. :P

    • @MattyTeare
      @MattyTeare 8 ปีที่แล้ว

      +sofias. orange tell that to FB

  • @justice4g
    @justice4g 8 ปีที่แล้ว +4

    I just wish people wouldn't use this format for everything these days..
    Y

    • @justice4g
      @justice4g 8 ปีที่แล้ว

      Reckless Roges
      I've had to process a lot of customer data in my job, most is CSV or tab delimited but sometimes we get XML which is less readable for determining programming requirements, not nearly as straight forward to work with and also takes a lot longer to transfer and process as the files are like 50 times larger than they would otherwise need to be. Sometimes XML just isn't the right format for the job.

  • @firstnamelastname-oy7es
    @firstnamelastname-oy7es 8 ปีที่แล้ว +4

    Simple Graphics,
    Miserable Language.

  • @jasonborden9104
    @jasonborden9104 8 ปีที่แล้ว

    Unfortunately the bits about HTML are not correct. HTML is and has never been based on XML. There was a side project that to make an XML-compliant HTML in XHTML but this has been superseded by HTML5 which is not XML-compliant. With HTML5 the parsing became standardized, tooling became better, and with the power of modern CPUs the drawbacks of an XML syntax outweigh the benefits and it will almost certainly never be XML-based. There IS XHTML5, but this is just an equivalent XML serialization of HTML5, and rarely used.

    • @profdaveb6384
      @profdaveb6384 8 ปีที่แล้ว +6

      +Jason Borden
      I did *NOT* say that HTML was "based on XML" (see your 2nd sentence) !!!!
      I was at great pains to say that HTML was intended to be an *application* of SGML but it was difficult to make it so because T B-L did not write a DTD
      The most useful long-term outcome of the "pub in the evening" discussions was XML itself which has been an enabling metasyntax for thousands of applications. However as you say, converting HTML into XHTML to make it be an application of XML was not the panacea (!) that many people thought it would be.

    • @martygentillon2786
      @martygentillon2786 7 ปีที่แล้ว

      Throughout the early oughts, XHTML4 was expected to replace HTML. However that never quite took.

  • @trackbit9hopugneru660
    @trackbit9hopugneru660 8 ปีที่แล้ว

    Huge :P at 5:08

  • @lnplum
    @lnplum 8 ปีที่แล้ว +2

    Joke's on you. WHAT WG HTML is no longer defined as an application of XML *or* SGML and instead brings its own parsing rules making it a language in its own right. So while XHTML and HTML 4 weren't "languages", WHAT WG HTML very much is. Though I prefer calling it a format to avoid these kinds of pedantic discussions.

  • @vkillion
    @vkillion 8 ปีที่แล้ว +6

    You should, just to confuse everyone, talk about ML, the functional programming language. It also ends in "ML" but has nothing to do with any of the "-MLs" we've talked about in these videos.

    • @GilesBathgate
      @GilesBathgate 8 ปีที่แล้ว +1

      +Vincent Killion Perhaps we should talk about objective camels.

    • @DavidVaughan00
      @DavidVaughan00 8 ปีที่แล้ว +1

      +Vincent Killion Talk about Machine Learning and MatLab in the same video

    • @Gooberpatrol66
      @Gooberpatrol66 7 ปีที่แล้ว

      He should also talk about Marxist-Leninists, to confuse people if he is even talking about computers anymore.

  • @andrewzuo86
    @andrewzuo86 8 ปีที่แล้ว

    You need to close paragraph tags? Huh.

    • @Elesario
      @Elesario 8 ปีที่แล้ว

      +andrew zuo Just write ;P

    • @recklessroges
      @recklessroges 8 ปีที่แล้ว

      +andrew zuo In HTML you SHOULD close paragraph tags. (Unless you are short of disk space and have space CPU cycles.) In XML you MUST close tags.

    • @Bunny99s
      @Bunny99s 8 ปีที่แล้ว

      +Jason Hardman That would make even less sense. If you write you're defining an empty tag, so the paragraph is closed before your actual text. So your text is actually between your paragraphs and not inside them. That means any style information tied to the paragraph-tag wouldn't affect the text.

    • @Elesario
      @Elesario 8 ปีที่แล้ว

      +Bunny83 Work from the assumption that I know exactly what a well formed HTML tag looks like and look at what I wrote again.

    • @Elesario
      @Elesario 8 ปีที่แล้ว

      +Error Error

  • @trackbit9hopugneru660
    @trackbit9hopugneru660 8 ปีที่แล้ว

    ! Huge :P at off 5:08

  • @watchoutformeplz
    @watchoutformeplz 5 ปีที่แล้ว

    PaRa

  • @girvix
    @girvix 8 ปีที่แล้ว

    second

  • @ingvarson185
    @ingvarson185 8 ปีที่แล้ว

    First