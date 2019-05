Newwi-d awal di tdiwennit-a akked umsenselkam Muḥend Belqasem ɣef yisenfaren i yerzan tutlayt taqbaylit deg tetiknulujit d umaḍal umḍin.

Aɣmis n Yimaziɣen : Efk-aneɣ-d tamuɣli tamatut ɣef yisenfaren n usideg d wanda wwḍen ?

Muḥend Belqasem : Isenfaṛen n usideg n yifecka (tasuqilt n yigrudmawen) mačči d leqdic ara ifakken. Ilaq kan ad ẓren yimdanen d akken ala ifecka n uɣbalu yeldin (opensource) ara nessidig, wama nger-d kra n wiyaḍ-nniḍen ur nelli d illiyen acku ddeqs n Yiqbayliyen i ten-yesseqdacen. Maca, akken ad yeddu uqeddic s zzreb, ilaq ad d-gren Yiqbayliyen afus-nsen deg leqdic am wa, ladɣa isdawanen yeqqaren deg yigezda n tmaziɣt di Tmurt Taqbaylit.

D acu i d-yettwasidgen ar tura ɣer teqbaylit, d wanwi yifecka i yettwasseqdacen s waṭas ?

Nezmer ad d-nmudd amedya n wid yemmden : Iminig Web Firefox n Mozilla, tagrumma n yisnasen n tnarit LibreOffice, atg. Wiyaḍ ad d-teddun akka tura. Ma nuɣal-d ɣer Firefox, iminig Web yettwassnen s waṭas deg umaḍal, ugar n 200 000 n yimdanen i t-yesseqdacen akka tura s teqbaylit, acku yella deg Windows, Linux, macOS, Android akked iOS. Llan daɣen wiyaḍ, am Evernote i zemren ad sqedcen medden meṛṛa. Ma d ifecka uzzigen, yezmer ad d-nebder WinSCP, FileZilla, POEdit, atg. Wigi meṛṛa llan s teqbaylit. Acu kan, leqdic ur yeqqim ara kan deg usideg. Llan daɣen sin n yisenfaren i icudden ɣer usesfer awurman n tutlayt akked ulqaḍ n tefyar; wigi d Common Voice akked Tatoeba. Wigi d isenfaren meqqṛen ara d-yeglun s ubaɣur urǧin yeḍra-as i tutlayt taqbaylit.

Tgiḍ, taggara-a, asarag deg Ayt Smaɛel ɣef umaḍal umḍin d usideg. Amek twalaḍ tiririt n uzayez?

Timetti taqbaylit, mazal ur tuɣ ara tannumi deg leqdic am wa. Mazal nettwali taqbaylit s wudem n wansayen. Imdanen, uɣen tannumi sseqdacen iselkimen neɣ tiliɣriyin s tutlayin iǧehden am tefransist neɣ taglizit. Ṛuḥen am wakken ur uminen ara ad taɣ teqbaylit amdiq deg umaḍal umḍin. Deg At Smaɛel, walan d akken taqbaylit, ma tebɣa tarwa-s, tezmer ad taɣ abrid n tussna tumḍint d tussniwin-nniḍen i as-d-yezzin : Tigzi tamacinant (intelligence artificielle), asesfer awurman n tutlayin (traitement automatique des langues)…atg. Tura, ur ilaq ara ad nesṭuqqut isaragen, imi d asileɣ i ilaqen i yimdanen akken ad izmiren ad d-mudden afus deg usnerni n teqbaylit deg umaḍal umdin, ama d asideg, afares n ugbur umḍin (production de contenu numérique), afares n yisefka isnilsanen (production de données linguistiques), neɣ taneflit n yifecka imaynuten s teqbaylit (développement de nouveaux outils).

I lmend n tfaska-nni dɣa, yettunefk-ak warraz Kaysa Xalifi n usideg. Awal ɣef tigawt-a n tdukkla tadelsant Adrar n Fad…

Feṛḥeɣ nezzeh s warraz i d-yettunefken sɣur tddukkla Adrar n Fad. Ayagi d asebɣes i yi-d-yerna. Ad snemreɣ aṭas tiddukkla imi i d-terra lewhi ɣer leqdic ussnan acku d leqdic n yimal. Ur ntettu ara ad d-nebder daɣen Kaysa Xalifi imi tella d tameṭṭut tamzwarut deg umezruy aqbayli i d-yegren afus deg unnar-a n usideg n yifecka. D nettat i d-ineǧren abrid i ddeqs n tlawin-nniḍen. Ur tettuɣ ara ad snemreɣ daɣen iɛeggalen n tddukkla meṛṛa ɣef usuddes igerrzen nezzeh n tfaska-agi yezrin. War ma ttuɣ aselway n Useqqamu aɣerfan aɣiwan n At Smaɛel ɣef lḥir meqqren i t-izedɣen ɣef teqbaylit. Mačči ddeqs n yiselwayen n tɣiwanin am netta i nesɛa deg Tmurt n Yiqbayliyen.

Amek yettaẓ usenfar n Common Voice ?

Ad d-nesmekti imeɣriyen n Uɣmis n Yimaziɣen d akken Common Voice d asenfar n Mozilla i uselmed n tmacinin akken ad mmeslayent tutlayin n yimdanen. Ɣer tama n tegrumma n yisefka Common Voice, yella usenfar amsedday neɣ amutur n uɛqal n taɣect n uɣbalu yeldin s yisem n Deep Speech.

Taqbaylit, tuɣ amḍiq wis tlata deg umaḍal deffir n teglizit akked telmanit s wugar n 212 n yisragen n usekles. Ttekkan deg-s akka tura azal n 542 n Yiqbayliyen i d-yettmuddun tuɣac-nsen s tɣuri n tefyar meẓẓiyen i d-nesskanay yal ass.

Seld « Tamaziɣt », yadra ad yili usenfar n Teqbaylit di Google Translate ?

Akken i d-nniɣ yakan deg tazwara, nesmenyif leqdic i icudden ɣer yisenfaren n uɣbalu yeldin. Google Translate d asenfaṛ n tkebbanit Google. Google d tuddsa i yesnuzun imeẓla (services). Ɣef wayen icudden ɣer tsuqilt tawurmant (traduction automatique), nsenker yiwen n usenfar yakan i itezzin ɣef teqbaylit. Akka tura, asekcem n teqbaylit deg Google Translate mačči d ayen yezwaren. Ad d-yas wass anida ad d-nezzi s telqay ɣef usenfar n usekcem n teqbaylit deg tsuqilt tawurmant. Ma d Tamaziɣt, ttwaliɣ am wakken d ugur seg yidis asnilsan. Acku ticki nessuter tasuqlit n tefyar seg teglizit ɣer tmaziɣt, d acu ara d-yerr uselkim? Taqbaylit? Tacawit? Tamẓabit? Teẓram d akken Tamaziɣt mačči d yiwet n tutlayt. Nekni nebɣa ad neqqim kan akken tella tmetti.

Ur yettɛeṭṭil ara ad yamed usideg n Facebook ɣer teqbaylit. Melmi i tezmer ad tili deg-s s wudem azayez, ɣef tmuɣli-k ?

Akka tura tewweḍ terbaɛt-nneɣ ɣer 91% deg leqdic n usideg n Facebook ɣer teqbaylit. Facebook, ad d-yeffeɣ s teqbaylit ticki yewweḍ ɣer 100%. Maca, yal yiwen yezmer ad d-yerr agrudem n Facebook s teqbaylit ɣas akken ur imid ara. Wid yebɣan, zemren ad d-ddun ɣer ugraw-nneɣ n usideg akken ad issinen amek. Iswi-nneɣ mačči d agrudem ara nerr ɣer teqbaylit. Nebɣa ɣer sdat ad nesseqdec tawuri n tsuqilt tawurmant i yellan deg Facebook. Ɣef waya, ad d-nger tiɣri i Yiqbayliyen akken ad ttarun s waṭas s tutlayt taqbaylit deg Facebook. Isefka-a, ad ten-naf ɣer sdat ticki yekker uqeddic-a n tsuqilt tawurmant anida ara nger taqbaylit gar uẓeṭṭa n tutlayin n umaḍal.

Nwala tɛerḍeḍ kra n yilguritmen akken ad tegzu tmacint tajerrumt n teqbaylit. D acu i issefraḥen deg-sen d wacu-ten wuguren i d-tettmagareḍ ?

Greɣ-d kra leqdic i icudden ɣer tesleḍt tasnalɣant taseddasant (analyse morphosyntaxique) akked tesleḍt n tjerrumt (analyse grammaticale). Leqdic-agi, d atiknikan yebna ɣef tiktiwin tineggura deg tetiknulujiyin timaynutin. Leqdic-a, icudd ɣer ddeqs n tussniwin gar-asent: Tasnilest (linguistique), asefser awurman n tultayt (NLP), tusnakt (maths), tigzi tamacinant (intelligence artificielle). D wa iwumi qqaren s umata: Tasnilest tasenselkamt (linguistique informatique). Ilguritmen-a i snefliɣ s tutlayt n usihel Python, ad ten-afen medden deg Github. Zemren ad ten-sqedcen. Ilguritmen-a, ttwaxedmen akken iselkimen ad lemden amek tetteddu telɣa akked tseddast neɣ tajerrumt n teqbaylit. Ilguritmen-a, bnan ɣef ulmad awurman (Apprentissage automatique), Almad lqayen (Apprentissage profond) akked ulmad s useǧhed (Apprentissage renfonrcé). Ilguritmen-a, ad d-ldin abrid ɣer sdat i tesleḍt n tesnamka (analyse sémantique) akked tesleḍt n yiḥulfan (analyse des sentiments) d ddeqs n yiḥricen-nniḍen i icudden ɣer tesnilest tasenselkamt.

Teggarem tiɣri i yinelmaden n tesdawiyin akken ad d-kecmen deg terbaɛt n usideg. Acḥal afmiḍi n lexsas n ufus deg yisenfaren ?

Ugur meqqren nezzeh deg uxeddim-a, d asiweḍ n wawal. Akken teẓram, nekni ur nqeddec ara akked tuddsiwin tunṣibin. Leqdic-nneɣ ɣas akken d ussnan, mazal-it deg timmeɣnest acku ɣef tutlayt taqbaylit kan ara nqeddec. Ihi, ulac ɣur-neɣ allalen neɣ kra n tedrimt. Nesseqdac iẓeḍwan inmettiyen akken ad nebḍu tiktiwin gar-aneɣ. Tikwal nessuddus isaragen neɣ tiɣimiyin n usileɣ i lfayda n yilemẓiyen, ladɣa wid yeqqaren deg yigezda n tmaziɣt. Maca, ass-a ilaq waṭas n ufus sɣur Yiqbayliyen-nneɣ akken ad nizmir ad nessekcem tabqylit deg kra n yimeẓla atraren, am tsuqilt tawurmant. Afus yella maca nesra ugar n waya.

Awal-ik n taggara…

Ad snemreɣ Aɣmis n Yimaziɣen i aɣ-d-imudden tagnit akken ad d-nemmeslay ɣef yisenfaren-agi. Ad nessiwel daɣen i tesdawiyin akken ad walint amek ara grent anaw-a n yisenfaren deg wahilen n uselmed i lfayda n yinelmaden n tmaziɣt. Ad d-greɣ tiɣri i yilemẓiyen d tlemẓiyin akken ad d-ddun ɣer yisenfaren-a, acku seg yiwet n tama, ad issinen tiktiyin timaynutin ara d-yernun ɣer wayen i lemmden akka tura deg tesdawit, seg tama-nniḍen ad d-rnun afus i trebbaɛ i iqeddcen imir-a deg ddeqs n yisenfaren srid deg Internet.

Yesteqsa-t M. K.