relpipe-data/principles.xml
author František Kučera <franta-hg@frantovo.cz>
Thu, 06 Dec 2018 21:16:49 +0100
branchv_0
changeset 189 47907749817f
parent 188 5b0fab48d59e
child 204 58c40f213028
permissions -rw-r--r--
screenshots: Haiku
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
23
0d2729ed16ed zkouška interního odkazu
František Kučera <franta-hg@frantovo.cz>
parents: 18
diff changeset
     1
<stránka
0d2729ed16ed zkouška interního odkazu
František Kučera <franta-hg@frantovo.cz>
parents: 18
diff changeset
     2
	xmlns="https://trac.frantovo.cz/xml-web-generator/wiki/xmlns/strana"
0d2729ed16ed zkouška interního odkazu
František Kučera <franta-hg@frantovo.cz>
parents: 18
diff changeset
     3
	xmlns:m="https://trac.frantovo.cz/xml-web-generator/wiki/xmlns/makro">
0d2729ed16ed zkouška interního odkazu
František Kučera <franta-hg@frantovo.cz>
parents: 18
diff changeset
     4
	
147
c004a45502b3 new pages: principles, roadmap, faq
František Kučera <franta-hg@frantovo.cz>
parents: 139
diff changeset
     5
	<nadpis>Principles</nadpis>
c004a45502b3 new pages: principles, roadmap, faq
František Kučera <franta-hg@frantovo.cz>
parents: 139
diff changeset
     6
	<perex>Basic ideas, principles and rules behind the Relational pipes</perex>
c004a45502b3 new pages: principles, roadmap, faq
František Kučera <franta-hg@frantovo.cz>
parents: 139
diff changeset
     7
	<pořadí>12</pořadí>
4
1bb39595a51c genrování hlavní nabídky #1
František Kučera <franta-hg@frantovo.cz>
parents: 2
diff changeset
     8
2
ab9099ff88fa vkládání zápatí, jmenné prostory, saxon
František Kučera <franta-hg@frantovo.cz>
parents: 1
diff changeset
     9
	<text xmlns="http://www.w3.org/1999/xhtml">
148
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    10
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    11
		<h2>Sane software</h2>
2
ab9099ff88fa vkládání zápatí, jmenné prostory, saxon
František Kučera <franta-hg@frantovo.cz>
parents: 1
diff changeset
    12
		<p>
148
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    13
			<m:name/> (both the specification and the reference implementation) should be developed according to the <a href="https://sane-software.globalcode.info/">Sane software manifesto</a> (not yet published).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    14
			Many of principles mentioned below are part of <em>being sane</em>. 
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    15
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    16
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    17
		<h2>Free software and open specification</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    18
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    19
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    20
			<m:name/> is and always will be a <a href="https://www.gnu.org/philosophy/free-sw.html">free software</a> and the specification of the format, tools and libraries will be open.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    21
			It must not be impaired by software patents or other similar restrictions.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    22
			In our country, we do not accept the existence of patents at all.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    23
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    24
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    25
		<h2>Divide and conquer</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    26
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    27
			Each program should do one thing and do it well. We should separate these three tasks:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    28
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    29
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    30
		<ul>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    31
			<li>data acquisition / creation</li>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    32
			<li>data transformation</li>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    33
			<li>data presentation</li>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    34
		</ul>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    35
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    36
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    37
			A single program should not combine two or more of these tasks. Or should at least allow to run in mode which does only one of them.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    38
			Thus we should be able to combine various programs together and get various presentations of the same data regardless the presentation features of the program that created the data.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    39
			We should be able to add another transformation on the path between the data origin and the data destination. For example filter out some unwanted data or modify or enhance the values.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    40
			Or we should be able to generate some mock/testing data and pass it through the original pipeline (sequence of transformations and the output filter) instead of the live data.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    41
			We should be free in how we combine the tools together.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    42
			We should be able to build even pipelines that was not expected by the authors of particulars tools we used.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    43
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    44
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    45
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    46
			Authors should focus on their task only – e.g. <em>interaction with the Kernel and capturing the inotify events</em> and should not bother about the presentation of the captured data.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    47
			There might be many output formats that makes sense (CSV, XML, table, YAML, \0 separated values etc.),
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    48
			but we should keep it <abbr title="Don't repeat yourself">DRY</abbr> and don't implement every format in every tool.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    49
			It would be a waste of time and also a source of errors, because when developing some additional format (which is not our core business) only <em>by the way</em> we would probably do it wrong. 
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    50
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    51
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    52
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    53
		<h2>Inputs, outputs and transformations as reusable libraries</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    54
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    55
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    56
			Parts of the <m:name/> implementation might be used as a library instead of as a filter in a pipeline.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    57
			This is not a primary purpose of our software, but sometimes it might be useful.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    58
			In such scenario the data are never serialized in the <m:name/> format but flows through a single process and its method/function calls.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    59
			For instance, if we need a tabular or CSV output in our program, we could adopt the code from the <m:name/> implementation as a library and call it internally without generating data in the <m:name/> format.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    60
			This might bring some performance benefits.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    61
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    62
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    63
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    64
			This is not a recommended approach, but should be possible.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    65
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    66
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    67
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    68
			However, in any case, we should provide also an option of producing <em>raw</em> data in the <m:name/> format and allow others to convert it to any other format according their needs.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    69
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    70
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    71
		<h2>Specification-first, contract-first</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    72
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    73
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    74
			The starting point for any developer should be the <m:a href="specification">specification</m:a> that defines the contract and the interface between the system components.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    75
			It should cover the data format and also the tools (inputs, transformers and outputs).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    76
			The specification must be verified by creating a reference implementation in at least one programming language.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    77
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    78
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    79
		<h2>Small code footprint and modular design</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    80
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    81
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    82
			The length of the program measured in source lines of code (SLOC) should be as small as possible.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    83
			Of course, the goal is not putting multiple statements on a single line.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    84
			We should avoid unnecessary complexity (see <a href="https://en.wikipedia.org/wiki/Cyclomatic_complexity">Cyclomatic complexity</a> – but the SLOC are easier to count and give also quite relevant information).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    85
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    86
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    87
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    88
			Modular design allows users to include (download, compile, run) only the portions of software they need.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    89
			If the user needs e.g. regular expressions and XML output to be happy, he should not be forced to include also the code for CSV, YAML, JSON and PDF.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    90
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    91
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    92
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    93
			Sane software is minimalistic in this way, which means that it is easy to audit, debug or modify.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    94
			Looking for a bug (or even a backdoor) or looking for the place where to add the new feature
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    95
			is much easier in a software that has hundreds or tousands of SLOC than in a software consisting of hundreds of thousands or even millions of SLOC.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    96
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    97
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    98
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
    99
			The developer who wants to generate (or consume on the other side) relational data, should include only circa few hundreds of SLOC.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   100
			This is the amount of code that could be read through in an hour or two.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   101
			<!--
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   102
			Thus implementing the relational output to an existing program should be matter of few hours.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   103
			-->
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   104
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   105
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   106
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   107
		<h2>Sane dependencies</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   108
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   109
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   110
			The libraries and the tools should not depend on any libraries other than the standard library of given programming language.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   111
			In the best case, of course.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   112
			This might be in coflict with the previous rule and then it is the question what is lesser harm.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   113
			It definitely makes no sense to write e.g. XML or YAML parser ourselves as a part of our tool.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   114
			Using high quality and well tested library is the only sane option.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   115
			But what about XML output? We can develop a reliable XML generator on few lines of code because we can implement only the subset of the standard that we need.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   116
			Writing such code is much more sane than including some bulky library that has several orders of magnitude more lines of code than our program.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   117
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   118
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   119
		<h2>Concise data serialization</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   120
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   121
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   122
			The <m:name/> data format should be concise – the data should be represented by reasonably small amount of bytes.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   123
			The format should support large amounts of small values and also sparse data (structures with many NULL/missing values) without wasting too much space.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   124
			The data that are not written don't need to be compressed and thus have the best compression ratio.
87
25dec6931f18 Lepší odsazení, tabulátory.
František Kučera <franta-hg@frantovo.cz>
parents: 23
diff changeset
   125
		</p>
148
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   126
		
188
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   127
		<h2>Streaming</h2>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   128
		
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   129
		<p>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   130
			Relational tools should process streams of data and should hold only necessary data in the memory
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   131
			i.e. the tool should produce the output (the first record) as soon as possible while still reading the input (following records).
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   132
			Thus the memory usage does not depend on the volume of processed data.
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   133
		</p>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   134
		
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   135
		<p>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   136
			However, there are cases where such streaming is not feasible e.g. if we need to compute some statistics or a column widths while printing a table in the terminal.
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   137
			In such situation, we must read the whole relation and only then generate the output.
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   138
			But we should still be able to do streaming on the relations level e.i. if there are more relation, we always hold only one of them in the memory.
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   139
		</p>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   140
		
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   141
		<p>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   142
			This rule is important not only from the performance point of view but also for user experience.
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   143
			The user should see the output as soon as possible i.e. the longer running processes will produce result continuously instead of flushing everything at the end.
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   144
			This is also good for debugging and <em>looking inside the things</em>. 
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   145
		</p>
5b0fab48d59e principles: streaming
František Kučera <franta-hg@frantovo.cz>
parents: 150
diff changeset
   146
		
148
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   147
		<h2>Unambiguity</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   148
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   149
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   150
			There should be only one way to represent a single value.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   151
			For example the booleans can be written as <code>00</code> (false) or <code>01</code> (true) and every other value (<code>02..FF</code>) should be invalid/unsupported.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   152
			Exceptions might occur if there are relevant reasons, but they should be rare.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   153
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   154
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   155
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   156
		<h2>Multiple files concatenation</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   157
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   158
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   159
			It should be possible to concatenate multiple files or streams of relational data as easy as we can concatenate multiple text files
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   160
			(given that such text files have same character encoding, have no BOM at the beginning and have a newline at the end).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   161
			If we can do:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   162
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   163
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   164
		<m:pre jazyk="bash"><![CDATA[
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   165
(cat file-1.txt; echo "some additional middle data"; cat file-2.txt) | wc -l
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   166
]]></m:pre>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   167
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   168
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   169
			We should also be able to do:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   170
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   171
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   172
		<m:pre jazyk="bash"><![CDATA[
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   173
(cat file-1.rp; relpipe-in-fstab; cat file-2.rp) | relpipe-out-xml
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   174
]]></m:pre>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   175
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   176
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   177
			Also, it should be possible to append (<code>&gt;&gt;</code>) new records to the last relation without modifying the already written data.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   178
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   179
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   180
		<h2>Work primarily with STDIO</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   181
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   182
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   183
			The tools should work primarily and by default with the standard input and standard output (STDIN and STDOUT).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   184
			Reading/writing from/to files or network should be (if present) a secondary and optional scenario.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   185
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   186
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   187
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   188
			Standard error output (STDERR) should be used for errors/warnings/logs. By default, it should not produce any output, if everything goes well.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   189
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   190
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   191
		<h2>Tools might be TTY-aware</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   192
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   193
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   194
			The input and output tools processing relational data might adapt their behaviour according to the fact whether their input resp. output is a terminal (TTY).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   195
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   196
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   197
			If the output is a TTY, it means that the output is displayed to the user, 
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   198
			so the tool might e.g. colorize its output or do some other human-friendly formatting – 
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   199
			which makes no sense, if the output is directed to a file or piped to another program.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   200
			Example:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   201
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   202
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   203
		<m:pre jazyk="bash"><![CDATA[
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   204
# This would print a table with fancy colors using ANSI sequences:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   205
relpipe-in-fstab | relpipe-out-tabular
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   206
			
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   207
# This would store the same table in a file but without any colors:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   208
relpipe-in-fstab | relpipe-out-tabular > table.txt]]></m:pre>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   209
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   210
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   211
			If the input is a TTY, it means that the user is typing the values.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   212
			In such situation, the tool might accept another input format (text, human-friendly) or use some default file location instead.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   213
			Example:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   214
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   215
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   216
		<m:pre jazyk="bash"><![CDATA[
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   217
# This would read the /etc/fstab (which is the default location):
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   218
relpipe-in-fstab | relpipe-out-tabular
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   219
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   220
# Those would read the /etc/mtab instead:
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   221
cat /etc/mtab | relpipe-in-fstab | relpipe-out-tabular
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   222
relpipe-in-fstab < /etc/mtab | relpipe-out-tabular]]></m:pre>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   223
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   224
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   225
			However, the behaviour should be modified in visual and expectable manner only.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   226
			It should not e.g. switch from XML to YAML.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   227
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   228
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   229
		<h2>Use --long-options</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   230
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   231
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   232
			Tools should accept arguments (if any) as <code>--long-options</code>.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   233
			When looking at a script, it should be clear – at first sight – what it does.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   234
			Which would not be if some cryptic short options like <code>-a -x -Z</code> were used.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   235
			In order to save our keyboards, there are features like <em>Bash completion</em>.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   236
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   237
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   238
		
150
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   239
		<h2>Be exact and reliable</h2>
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   240
		
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   241
		<p>
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   242
			<m:name/> should convey data without corrupting or waywardly modifying them.
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   243
			Implementation details (e.g. how values are encoded in the stream) should not affect transferred data and the user.
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   244
		</p>
7d7d4e1f293f principles: Be exact and reliable
František Kučera <franta-hg@frantovo.cz>
parents: 148
diff changeset
   245
		
148
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   246
		<h2>Fail-fast, be strict</h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   247
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   248
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   249
			Because the relational data will be created by machines instead of being manually typed by erring humans,
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   250
			we should fail-fast on an error. We should be strict and require valid inputs only.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   251
			Any error should be revealed as soon as possible and fixed.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   252
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   253
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   254
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   255
			There might be tools or options for recovering corrupted data (caused e.g. by a failing HDD or a faulty network or a buggy software).
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   256
			But the recovery mode is not the default one.
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   257
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   258
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   259
		<p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   260
			We demand reliable systems – not random and accidential behaviour caused by software guessing <em>What might probably these bytes mean?</em>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   261
		</p>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   262
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   263
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   264
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   265
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   266
		
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   267
		<h2></h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   268
		<h2></h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   269
		<h2></h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   270
		<h2></h2>
d51787006954 principles
František Kučera <franta-hg@frantovo.cz>
parents: 147
diff changeset
   271
		
87
25dec6931f18 Lepší odsazení, tabulátory.
František Kučera <franta-hg@frantovo.cz>
parents: 23
diff changeset
   272
	</text>
4
1bb39595a51c genrování hlavní nabídky #1
František Kučera <franta-hg@frantovo.cz>
parents: 2
diff changeset
   273
1
a05c6f3cbc3e základ, první verze
František Kučera <franta-hg@frantovo.cz>
parents:
diff changeset
   274
</stránka>