Чи це рішення, якщо мова, описана за кількістю випадків, є регулярною?

Відомо, що мова слів, що містять рівну кількість 0 і 1, не є регулярною, тоді як мова слів, що містять рівну кількість 001 і 100, є регулярною ( див. Тут ).

Враховуючи два слова $w_1,w_2$ , чи вирішується мова, якщо мова слів, що містять рівну кількість і є регулярною? $w_1$ $w_2$

regular-languages undecidability

— sdcvvc
джерело

Чи можете ви навести інші приклади так визначених звичайних мов, крім

та

, або

? Як щодо прикладу алфавіту з 3 символами?

1^{i} 0

$1^i0$

01^{i}

$01^i$

0^{i} 1

$0^i1$

10^{i}

$10^i$

— бабу

Якщо

є суворим підсловом

, є велика ймовірність, що мова порожня, тому регулярна. Я не знаю інших прикладів.

w_{1}

$w_1$

w_{2}

$w_2$

— sdcvvc

Я з легкістю підозрюю, що наведені вище приклади є єдиними, що дозволить вирішити проблему. Якщо ви вкажете лише дві підрядки, я б припустив, що це CF ... залежно від того, що ви можете вказати щодо випадків. Ви не чітко уточнюєте, що маєте на увазі під "описом кількості випадків".

— бабу

Орган питання досить точний ІМО.

— sdcvvc

поки що рішення для особливих випадків залежать від ідеї, що виникнення підрядків

гарантує лише поодинокі випадки втручання

. тому якимось чином припускаючи, що поточні відповіді є правильними [мені ще не зрозуміло], здається, що між

існує певне співвідношення , яке гарантує в середині сканування рядка, що можна бути в будь-якому стані "рівним" або "нерівним" ", але виключається лише максимальним кінцевим числом для" нерівного "випадку.

w_{1}

$w_1$

w_{2}

$w_2$

w_{1}

$w_1$

w_{2}

$w_2$

— vzn

Враховуючи два слова , , чи вирішується, якщо мова слів, що містять рівну кількість і $w_1$ $w_2$ $L$ $w_1$ $w_2$ є регулярною?

Спочатку кілька визначень:
їх можна зробити більш короткими, а позначення можна вдосконалити, якщо вони будуть використані в доказуванні. Це лише перший проект.

Давши два слова і , ми скажемо, що: $w_1$ $w_2$

завжди відбуваєтьсяз , відзначається $w_1$ $w_2$ $w_1\triangleleft w_2$ , iff
1. для будь-якого рядка такого, що з $s$ $s=xw_2y$ і відбувається інше розкладання . Примітка: Умова, що і $\mid x\mid,\, \mid y\mid\ \geq \mid w_1\mid +\mid w_2\mid$ $|x|_0,|x|_1|,|y|_0,|y|_1| \geq 1$ $s=x'w_1y'$
  $x$ $y$ кожен містить щонайменше 0 і 1, необхідний патологічний випадок (знайдений @sdcvvc): , і $w_1=1^i0$ $w_2=v1^{i+j}$ $y\in1^*$ та його симетричні варіанти.
2. є рядок з $s=xw_2y$ таким чином, що існує максимум одне розкладання $\mid x\mid,\, \mid y\mid\ \geq \mid w_1\mid +\mid w_2\mid$ $s=x'w_1y'$
завжди триваєз , відзначено $w_1$ $w_2$ , якщо кожен завжди відбувається з іншим, $w_1\triangleleft \triangleright\,w_2$
і відбуваються незалежно, відзначається $w_1$ $w_2$ , якщо ні один не трапляється з іншим, $w_1\triangleright \triangleleft\,w_2$
завжди відбувається раз або більшеніж , відзначено , якщо для будь-якого рядка такещо с є інші розклади $w_1$ $m$ $w_2$ $w_1\triangleleft_m w_2$ $s$ $s=xw_2y$ $\mid x\mid,\ \mid y\mid|\ \geq \mid w_1\mid +\mid w_2\mid$ $m$ $s=x_iw_1y_i$ для такий, що означає . $i\in[1,m]$ $i\neq j$ $x_i\neq x_j$

Ці визначення побудовані так, що ми можемо ігнорувати те, що відбувається на кінцях рядка, де, як передбачається, відбудуться і . Прикордонні ефекти в кінці рядка потрібно аналізувати окремо, але вони являють собою обмежену кількість випадків (насправді я думаю, що я забув один-два таких крайових підпункти у своєму першому аналізі нижче, але це насправді не має значення). Визначення сумісні з перекриттям подій. $w_1$ $w_2$

Розглянути 4 основні випадки (ігнорування симетрії між і ): $w_1$ $w_2$

Обидва слова обов'язково поєднуються разом, за винятком випадків на кінцях рядка. Це стосується лише пар форми і , або і . Це легко розпізнається за допомогоюкінцевого автомата,який перевіряє наявність одиноких випадків на обох кінцях рядка, який слід розпізнати, щоб переконатися, що є одиночне виникнення на обох кінцях або на жодному з них. Існує також вироджений випадок, коли : тоді мова L, очевидно, регулярна. $w_1\triangleleft \triangleright\,w_2$
$1^i0$ $01^i$ $0^i1$ $10^i$ $w_1=w_2$
$w_1\triangleleft w_2$ , but not $w_2\triangleleft w_1$
One of the 2 words cannot occur without the other, but the converse is not true (except possibly at the ends of the string). This happens when:
- $w_1$ is a substring of $w_2$ :then a finite automaton can just check that $w_1$ does not occur outside an instance of $w_2$ .
- і для деякого слова , : тоді кінцевий автомат перевіряє, як і в попередньому випадку, що не трапляється відокремленим від . Однак автомат дозволяє рахувати один додатковий екземпляр який дозволить прийняти, якщо $w_1=1^i0$ $w_2=v1^j$ $v\in\{0,1\}^*$ $v\neq01^i$ $w_1$ $w_2$ $w_1$ $w_2$ є суфіксом рядка. Є три інші симетричні випадки (1-0 симетрія та ліво-права симетрія).
$w_1\triangleleft_2 w_2$
One of the 2 words occurs twice in the other. That can be recognized by an a finite automation that checks that the smaller word never occurs in the string. The is also a slightly more complex variant that combines the two variations of case 2. In this case the automaton checks that the smaller string $1^i0$ never occurs, except possibly as part of $v$ in the larger one $v1^j$ coming as a suffix of the string (and 3 other cases by symetry).
$w_1\triangleright \triangleleft\,w_2$
The 2 words can occur independently of each other. We build a generalized-sequential-machine (gsm) $G$ that output $a$ when it recognizes an occurrence of $w_1$ and $b$ when recognizing an occurrence of $w_2$ , and forgets everything else. The language $L$ is regular only if the language $G(L)$ is regular. But $G(L)=\{w\in\{a,b\}^*\mid\ \mid w\mid_a=\mid w\mid_b\}$ which is clearly context-free and not regular. Hence $L$ is not regular.
Actually we have $L=G^{-1}(G(L))$ . Since regular languages and context-free languages are closed under gsm mapping and inverse gsm mapping, we know also that $L$ is context free.

One way to organize a formal proof could be the following. First build a PDA that recognizes the language. Actually it can be done with a 1-counter machine, but it is easier to have two stack symbols to avoid duplicating the finite control. Then, for the cases where it should be a FA, show that the counter can be bounded by a constant that depends only on the two words. For the other cases show that the counter can reach any arbitrary value. Of course, the PDA should be organized so that the proofs are easy enough to carry.

Representing the FA as a 2-stack-symbols PDA is probably the simplest representation for it. In the non-regular case, the finite control part of the PDA is the same as that of the GSM in the proof sketch above. Instead of outputting $a$ 's and $b$ 's like the GSM, the PDA counts the difference in number with the stack.

— babou
джерело

I had a question about context-freeness in the case of three words. I deleted it when I realised it could be analyzed similarly. I had first thought that proving non-CFness would make an original exercise, but the GSM ruins it.

— babou

It is not clear what do you mean by "occur independently of each other", "come necessarily together" etc. Please write formal definitions instead, and prove that they cover all cases.

— sdcvvc

I am not sure what you are asking, and what level of formalization you need, for what purpose. I realized that analyzing by hand possible relations of the two words is not garanteed to be correct, and does not matter anyway. What matters is whether an occurence of one word can exist without creating at the same time an occurence (or several) of the other word. The details do not matter as it will always be localized and thus manageable finitely. The two ends do not matter either as tey are localized too. Even overlaps of occurrences do not matter since they can only be finitely many in 1 place

— babou

I asked you about precise definitions of the terms mentioned in the comment. Thank you for writing them. Was I supposed to guess them previously? Anyway, you seem to claim that

0^{i} 1 ◃ ▹ 1 0^{i}

$0^i 1 \triangleleft \triangleright 1 0^i$ . This does not satisfy condition 1. of the definition of "

w_{1}

$w_1$ always occurs with

w_{2}

$w_2$ ", since there is no occurrence of

1 0^{i}

$1 0^i$ in

s = 0^{M} 0^{i} 1 1^{M}

$s=0^M 0^i 1 1^M$ .

— sdcvvc

Sorry, I did not mean to make you guess. It only took me time to understand what exactly you wanted. My failing only. Regarding your counter example, you are correct. But for me it only means that I have to be a little bit more careful about telomeres, in the definition of the relations. I defined them too quickly, but

0^{M}

$0^M$ or

1^{M}

$1^M$ do not convey much information in this context. This is really a boundary pathological example within a pathological case, that actually cannot occur when more than 2 symbols are used. I just do not believe it changes anything.

— babou