Things_You_Should_Know.txt 6.1 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149
  1. OMeta/JS is a new version of OMeta, a language for pattern-directed metaprogramming first described in
  2. Alessandro Warth and Ian Piumarta, "OMeta: An Object-Oriented Language for Pattern-Matching," in Proceedings of the Dynamic
  3. Languages Symposium, 2007. (Available at http://www.cs.ucla.edu/~awarth/papers/dls07.pdf)
  4. This page contains the information necessary for someone who has read the OMeta paper to be able to use OMeta/JS.
  5. Pattern Syntax
  6. --------------
  7. +------------------------------------------------------------------------+
  8. | "kind of thing" OMeta OMeta/JS |
  9. +------------------------------------------------------------------------+
  10. | boolean true true |
  11. | number 123 123 |
  12. | character 'x' 'x' | (see note #1)
  13. | string "foo" 'foo' |
  14. | `foo |
  15. | #foo |
  16. | atom foo N/A |
  17. | rule application <expr> expr |
  18. | <r x y> r(x, y) | (see note #3)
  19. | <super stmt> ^stmt | (see note #4)
  20. | list ("hello" 42 answer ()) ['hello' 42 `answer []] |
  21. | negation ~'x' ~'x' |
  22. | look-ahead ~~'x' ~~'x' |
  23. | &'x' |
  24. | semantic predicate ?(> x y) ?(x > y) | (see note #2)
  25. | semantic action => (+ x y) -> (x + y) |
  26. | !(+ x y) !(x + y) |
  27. | binding <expr>:x expr:x | (in OMeta/JS, spaces are not allowed before the colon)
  28. | :x | (this is shorthand for "anything:x")
  29. +------------------------------------------------------------------------+
  30. Note #1: There is no such thing as a character in JavaScript. Even though the language lets you access each "character" of a string via indexing, e.g, "foo"[0], the answer is not a character, but rather a string of length 1.
  31. Note #2: In the version of OMeta described in the paper, semantic actions and predicates were written in COLA (kind of a mix between Scheme and Smalltalk). In OMeta/JS, they are written in JavaScript. More specifically, they are either primary expressions, e.g.,
  32. 123
  33. x
  34. foo.bar()
  35. new Person()
  36. (x + y) // note that you need parentheses around "x + y" in order to make it into a primary expression
  37. or something I made up called "statement expressions", which have the form
  38. "{" <statement>* <expr> "}"
  39. For example,
  40. { x += 2; y = "foo"; f(x) }
  41. The value of a statement expression is equal to that of its last expression.
  42. Note #3: The arguments you pass to a rule don't have to be statement expressions - they can be any JavaScript expression.
  43. Note #4: In OMeta/JS, "super" is just like any other rule (not a special form), so you have to quote the rule name that you pass in as an argument, e.g., both ^r(1, 2) and super("r", 1, 2) are valid super-sends.
  44. A "Handy" New Shorthand
  45. -----------------------
  46. In OMeta/JS, the pattern
  47. "foo"
  48. does not match the string 'foo'; it is instead shorthand for
  49. token('foo')
  50. The Parser grammar provides a definition for token that skips any number of spaces, then tries to match the sequence of characters that was passed to it as an argument. I have used this in many of the example projects, and have found it to be very useful.
  51. Still, there are times when this is not what you want. But that's not a problem, because you can define it to do whatever you want (see the JavaScript Compiler project for an example).
  52. Rules
  53. -----
  54. Here is a parameterized rule taken from the paper, in the original OMeta syntax:
  55. cRange x y ::= <char>:c ?(>= c x)
  56. ?(<= c y) => c;
  57. And here is the same rule rule, in the new OMeta/JS syntax:
  58. cRange :x :y = char:c ?(c >= x)
  59. ?(c <= y) -> c
  60. A couple of (purely syntactic) differences:
  61. (1) rule declarations now use "=" instead of "::=", and
  62. (2) they are no longer terminated with a ";"
  63. A more significant difference has to do with the rule's arguments; note that in the OMeta/JS version, they are preceded by a ':'. This is actually shorthand for
  64. cRange anything:x anything:y = ...
  65. This change has to do with an improvement in the parameter-passing mechanism, which now allows a rule's parameters to be pattern-matched against. (See the paper's "Future Work" section for more details.)
  66. The "=" is actually optional in rule declarations... this, combined with some new syntax that allows a rule to have multiple definitions that are tried in lexicographic order, allows programmers to write rules that have an "ML flavor":
  67. ometa M {
  68. fact 0 -> 1,
  69. fact :n ?(n > 0) fact(n - 1):m -> (n * m)
  70. }
  71. M.match(5, "fact")
  72. Grammar Syntax
  73. --------------
  74. The only change here has to do with rule declarations, which now must be separated by commas:
  75. ometa M {
  76. x = y z,
  77. y = "foo" "bar",
  78. z = "baz"
  79. }
  80. Using Grammars "from the outside"
  81. ----------------------------------
  82. The public interface provided by an OMeta/JS grammar object to the rest of the world consists of two methods:
  83. match(object, ruleName)
  84. and
  85. matchAll(arrayOrStringObject, ruleName)
  86. Here's an example that hopefully explains the difference between the two. The key to understanding it is that a string is just a list of characters.
  87. ometa M <: Parser {
  88. theCharacters = "the" "cat" "sat" "on" "the" "mat",
  89. theWholeString = [theCharacters]
  90. }
  91. input = "the cat sat on the mat"
  92. M.matchAll(input, "theCharacters")
  93. M.match(input, "theWholeString")