<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>Data with Rough Attributes and Its Reduct Analysis&nbsp;&nbsp;</p>
<p>Prem Kumar Singh1,*</p>
<p>1Department of Computer Science and Engineering,</p>
<p>Gandhi Institute of Technology and Management-Visakhapatnam,</p>
<p>Andhra Pradesh 530045, India</p>
<p>* Correspondence: premsingh.csjm@gmail.com , premsingh.csjm@yahoo.com</p>
<p>ORCID: 0000-0003-1465-6572</p>
<p>&nbsp;</p>
<p>Abstract: Recent time many researchers focused on dealing the uncertainty and its characterization. The precise approximation of uncertainty in many-valued data set is one of the major tasks. It becomes more difficult in case the given data sets are non-Euclidean. Hence the rough fuzzy set and its graphical visualization is introduced in this paper for knowledge processing tasks.</p>
<p>Keywords: Fuzzy Rough graph; Knowledge representation; Many-valued attributes; Non-Euclidean geometry; Rough Set; Rough graph</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>1. Introduction</p>
<p>The uncertainty and its approximation is considered as one of the major tasks for soft computing researchers [1-2]. It become more crucial while dealing the data with non-Euclidean [3-4] or cubic set [5].&nbsp; To deal with this issue rough set and its properties is introduced by Pawlak [6-7]. The rough set given a way to approximate the given data sets based on its lower and upper approximation. Due to which the properties of rough set is applied in various fields for multi-decision process [8-11] as well as its graphical visualization [12-16]. This gave a way to characterize the uncertainty in three-way decision space [17-19]. In this process, a problem is addressed while dealing the data with rough attributes and its reduct. To solve this problem current paper focused on illustrating the data with rough attributes, its contextual representation and reducts.&nbsp;</p>
<p>&nbsp;</p>
<p>The motive is to characterize them based on lower, upper and boundary regions as shown in Figure 1. The objective is to provide a basic understanding for new researchers for dealing the data with rough attributes.&nbsp;</p>
<p>&nbsp;</p>
<p>Figure 1: The motivation of this paper and its objective</p>
<p>Rest part of the paper is organized as follows: Section 2 provides background Fuzzy and Rough set. Section 3 contains the proposed method for characterization of rough context and its fuzzy membership-values with an illustrative example in Section 4. Section 5 contains conclusions followed by acknowledgements and references.&nbsp;&nbsp;</p>
<p>2. Background&nbsp;&nbsp;</p>
<p>This section provides the basic background to represent the data with rough attributes and its set approximation for decision making process.</p>
<p>Information System</p>
<p>&nbsp;The Table 1 represents the data with information system where row represents the a set of non-empty objects {O1,O2,..O6}, the columns represents the attributes (A) with defined multi-valued information (R) in the given&nbsp; universe (U). In this way it provides an information system with tuple of 4-attributesS = (U,R,V,f). It can be also represented as S∶= (U,A), where A is non-empty set of attributes set such that for each R^1&sube;A where R = (C&cup;D) i.e. subsets of conditional (C) and decision attributes (D). Table 1 represents following as conditional i.e. C = {A1,A2,A3} and decision attributes i.e. D= {A4}, where V_(i )is the set of values of i^th attribute i.e. A_1:= {yes,yes,yes,no,no,no},f∶ R&rarr;Vis a description or information objective function. These data can be analyzed using the indiscernible relation and its set approximation.&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Table 1: The data with Rough Attribute and its contextual representation&nbsp;</p>
<p>Objects Attributes Decision Flue (A4)</p>
<p>Temperature(A1)Headache(A2)Muscles pain(A3)</p>
<p>O1 normal yes yes no</p>
<p>O2 high yes yes yes</p>
<p>O3 Very-high yes yes yes</p>
<p>O4 normal no yes no</p>
<p>O5 high no no no</p>
<p>O6 Very-high no yes yes</p>
<p>&nbsp;</p>
<p>Indiscernible Relation</p>
<p>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;The associated equivalence relation on universe (U) for a given nonempty subset of attribute set with any R^1&sube;R is defined as 〖IND〗_S (R^1)∶= {(x,y) ϵU^2&nbsp; | &forall;_(rϵR^1 ) (r_((x))= r_((y))} , where (x,y) ϵ〖IND〗_S (R^1)&nbsp; are defined as object x and y are inducible by attribute of from R^1. The equivalence class of R^1- indiscernible relations are denoted as 〖[x]〗_(R^1 ). The pair of (U,〖IND〗_S (R^1) ), called estimated space. As for example: The set consists of nonempty subset of attributes &ldquo;Headache&rdquo; and &ldquo;Muscle pain&rdquo;&nbsp; i.e., A_1 and A_2.IND(A_1,A_2)∶= {{O_1,O_2,O_3},{O_4,O_6},{O_5}} containing three indiscernible sets also called elementary sets, one definable set {O_1,O_2,O_3,O_5}.Similarly, the other possible non-empty indiscernible subsets of C are as follows:</p>
<p>IND(A_1 ),IND(A_2 ),IND(A_3 ),IND(A_1,A_2,A_3 ),IND(A_1,A_3),IND(A_1,A_2),IND(A_2,A_3).</p>
<p>In this way the given information system can be defined based on approximating the set.&nbsp;</p>
<p>Set Approximations</p>
<p>It can be observed that the equivalence relations induce a partitioning of universe(U), can be used to create a new subset that are more often of interest have the same values for decision attribute(D).&nbsp; Let R^1&sube;R be a desired subset of U. The description for R^1&nbsp; is desired when we can determine the membership status of each object in U w.r.t R^1, if the 〖[x]〗_(R^1 ) containing partial overlaps with any of the indiscernible defined for an object with an ambiguity. Such an object may not be distinguished, therefore the description of R^1 is defined in-terms of lower (P_* (R^1)), upper (P^* (R^1)) approximation sets respectively also called as positive (POS), negative (NEG) and boundary regions (BND) as follows:</p>
<p>P_* (R^1) = POS (R^1) = {xϵU | [x] &sube;R^1}, where [x]denotes the equivalence-class of x. &hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(i) &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;</p>
<p>P^* (R^1) = NEG (R^1) = {xϵU | [x] &cap;R^1&nbsp; &ne;&Theta;}&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;.&hellip;(ii)</p>
<p>BND (R^1) =P^* (R^1) -&nbsp; P_* (R^1) &hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(iii)</p>
<p>&nbsp;</p>
<p>Figure 2:&nbsp; The rough set theory approximations of Table 1&nbsp;</p>
<p>A set R^1 for which P_* (R^1) =P^* (R^1)&nbsp; is called as &ldquo;exact set&rdquo; otherwise rough-set w.r.t P. If an object x &isin;P_* (R^1), then it belongs to target-set&nbsp; R^1 certainly. For any target or decision attribute subset D &sub;Uand conditional attributes C &sub;R, D is obtained as roughset when P_* (Y) &ne;P^* (Y).The roughness of set D w.r.t&nbsp; C is identified as follows: P_C (Y) = 1 -&nbsp; (|P_* (Y)|&nbsp; )/(|P^* (Y) |), where Y &ne;ϕ (if Y = ϕ, then P_C (Y) = 0); |.|&nbsp; denote the cardinality essence of a set. Similarly, correctness is defined as &alpha;_C (Y) =&nbsp; (|P_* (Y)|&nbsp; )/(|P^* (Y) |), then apparently 〖0 &le;&alpha;〗_C (Y) &le;1. If &alpha;_C (Y)= 1,then Y is said to be "CRISP " w.r.t C,&alpha;_C (Y)&lt; 1 then it is "ROUGH". If an object, x &isin;P^* (R^1 ), it cannot be determined whether it belongs to the target or not. If an object, x &isin;BND(R^1), then it does not belong to target-set R^1certainly. A set is said to be &ldquo;ROUGH&rdquo;, if it's&nbsp; BND (R^1) &ne;ϕ, otherwise the set is &ldquo;CRISP&rdquo;. As for example, the objects O_2&nbsp; and O_5 can not be distinguished (i.e indiscernible) from anyone of the attributes shown in Table 1. Hence, the objects present in BND (R^1) region is {O_2,O_5}, which can not be classified properly based on knowledge O_2 and O_5 as shown in Figure 2. It shows that O_2&amp;O_5are boundary line cases. The remaining objects in lower and upper regions as follows:</p>
<p>P_* (Flu = "yes") = {O_1,〖O_3,O〗_6},P_* (Flu = "no") = {O_4} ,BND(R^1) = {O_2,O_5} &hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(iv)</p>
<p>P^* (Flu = "yes" )= {O_1,O_2,O_3,O_5,O_6 }, &hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(v)</p>
<p>P^* (Flu = "no") ={O_2,O_4,O_5},BND(R^1) = {O_2,O_5}&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(vi)</p>
<p>In this way, the set of approximation and its rough membership can be defined. To achieve this goal, a step by step method is illustrated in the next section.&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>The Rough Membership, Core and Reduct Analysis</p>
<p>It can be observed that, the data with rough attributes can be approximated based on lower, upper and boundary regions. The problem is how to characterize them in a membership function. To resolve this issue step by step demonstration is discussed in this section as given below:&nbsp;</p>
<p>Defining the Rough Membership Functions</p>
<p>&nbsp; &nbsp;The set approximations can be defined based on the degree of overlapping regions between the {X}-set and the equivalence membership relation R_((X)), to which the object x belong to a set or not,&nbsp; it is defined using the membership function shown below:&nbsp;&nbsp;</p>
<p>&nbsp;</p>
<p>Figure 3: The characterization of rough-attributes as Membership Functions</p>
<p>&mu;_( x)^R ∶ U&nbsp; &rarr; ≼0,1≻, i.e., function accepts only the values 1 and 0 respectively, where &mu;_( X)^R&nbsp; (x)&nbsp; =&nbsp; (|X &cap; R_((x)) |)/(|&nbsp; R_((x))&nbsp; &nbsp;|)and |.| called the cardinality essence of an attribute (X).&nbsp; The meaning of rough -membership function indicates the assumptions and boundary regions of a set (X) is defined as below equations and its diagrammatic representations are shown in Figure 3.</p>
<p>R_( *)&nbsp; (X)&nbsp; = { x ϵ&nbsp; U ∶〖&mu; 〗_( X)^R&nbsp; (x)= 1 }&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;</p>
<p>&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(vii)</p>
<p>R^( *)&nbsp; (X)&nbsp; = { x ϵ&nbsp; U ∶〖&mu; 〗_( X)^R&nbsp; (x) &gt;= 0 }</p>
<p>&hellip;&hellip;&hellip;.&hellip;&hellip;&hellip;&hellip;(viii)</p>
<p>R_( *)&nbsp; (X)&nbsp; = { x ϵ&nbsp; U ∶〖0 &lt; &mu; 〗_( X)^R&nbsp; (x) &lt; 1 }</p>
<p>&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(ix)</p>
<p>Dependency of Decision System Attributes</p>
<p>The major issue with the Decision System is to identify same or indiscernible-objects that may appear several times, due to this the attributes of (C &cup;D) leads to superfluous for most of the Machine Learning Classifiers to design an effective Classification Model. Finding dependency and removal of such attributes may not degrade the performance of classification models. The decision system with A_iattributes totally depends on predicted attribute setD, and its relation called as A_i&nbsp; &rarr;D, if all the values in attribute A_i are uniquely identified (classify) by the values of Di.e., A_idepends on D, there exists functional dependency. In more general, the concept discusses about the partial dependencies of attributes i.e., only some set of A_i values are classifying the values of decision attribute(D). The RST introduce a degree of dependency measure to calculate dependency between two subset of attributes (A_i,D &sube;R) is denoted as &lambda;_(A_i ) (D). It is defined as shown below :</p>
<p>&lambda;_(A_i ) (D) = (card (〖POS〗_(A_i ) (D)))/(card (U)), where 〖POS〗_(A_i ) (D) = &cup;_(X ϵ U/ IND(D)) 〖A_i〗_* (X) &hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;&hellip;(x)</p>
<p>The set〖 POS〗_(A_i ) (D), positive region containing possible elements of Uthat can be uniquely distinguished from the partition&nbsp; U/(IND(D)) byA_i. The objects of &lambda;_(A_i ) (D)represents fraction of total no. of objects in the universe (U)that can be properly classified the elements of decision attributeD. If A_itotally depends on,D then &lambda;_D (A_i) = 1; else &lambda;_D (A_i)&lt; 1.&nbsp;</p>
<p>For better understanding the concept from above table (), the dependency of FLU (A_4)on Temperature(A_3), we observe that the values of (A_3)uniquely identifies some values of decision attribute(A_4), i.e., (A_3, very high)&rArr;(A_4,yes), similarly〖(A〗_3,normal)&rArr;(A_4,no), but(A_3,high)&ne;(A_4,yes), hence there exist partial dependence between&nbsp; A_3 andA_4. To determine &lambda;_(A_3 ) (A_4 )&nbsp; using above equation as shown below:</p>
<p>U = {O_1,O_2,O_3,O_4,O_5,O_6} and U/(IND(A_4))= {{O_1,O_2,O_3,O_6},{O_4,O_5}}</p>
<p>〖POS〗_(A_3 ) (A_4) = {O_3,O_6}&cup;{O_4} = {O_3,O_4,O_6} ,&nbsp;</p>
<p>Thus&lambda;_(A_3 ) (A_4) = 3/6 = 0.5. Similarly, &lambda;_(A_1 ) (A_4) = 0 and &lambda;_(A_2 ) (A_4) = 0.</p>
<p>Accuracy Approximation</p>
<p>For a given real time decision systemS = (U,R,V,f), for any target variable subset X &sube;U and its attribute subsetA &sube;R, the roughness of set X w.r.t A about the classification model can be defined as below Eq. (3.8).</p>
<p>P_A (X) = 1 -&nbsp; (|R_( *)&nbsp; (X)|)/(〖|R〗^( *)&nbsp; (X)|)&nbsp; &nbsp; , obviously 〖0 ⪯P〗_A⪯1,when X &ne;ϕ; if X = ϕ,then P_A (X) = 0; if P_A (X) = 1,then X is said to be &ldquo;CRISP&rdquo; w.r.t A; similarly when P_A (X) &lt; 1,then X is called &ldquo;ROUGH&rdquo; w.r.t A.</p>
<p>Reducts</p>
<p>One often a raises the question, how to remove irrelevant or redundant/superfluous attributes from a decision system by preserving its basic intrinsic properties including appropriate representation space for the learning system. RST allows identifying equivalence or in-discernible class relations, finds a minimal attribute subset that differentiate the entire classes of decision-attribute without deteriorating the performance of the classification model or towards decision making applications. There are several such minimal attribute subsets called &ldquo;REDUCTS&rdquo; of the original set which retain the accuracy like the original set, and thus reduce the computational time.</p>
<p>Core</p>
<p>The set of conditional attributes〖(A〗_i) are unreliable in T, denoted as CORE(A_i) , such that CORE(A_i) = &cap;RED(A_i)&nbsp; i.e intersection of all 〖relative〗_reducts is termed as 〖relative〗_core, each object of the core belongs to some reduct with an important minimal subset of attribute set, and further none of its objects could be excluded.</p>
<p>For example table 3.1, have two possible reducts i.e., 〖RED〗_1= {A_3,A_1 }&nbsp; and 〖RED〗_2={A_3,A_2} w.r.t decision attribute〖 A〗_4, the intersection (core) of the decision Table 1 is〖 A〗_3. Table 2 and Table 3 represents the minimal decision tables of 〖RED〗_1&nbsp; and〖 RED〗_2. In this way the rough provides a way to deal with multi-valued data for decision making process.&nbsp;</p>
<p>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Table 2: The RED1 for data with rough attributes shown in Table 1</p>
<p>Objects Attributes Decision Flue (A4)</p>
<p>Temperature(A1)Headache(A3)</p>
<p>O1 normal yes no</p>
<p>O2 high yes yes</p>
<p>O3 Very-high yes yes</p>
<p>O4 normal no no</p>
<p>O5 high no no</p>
<p>O6 Very-high no yes</p>
<p>&nbsp;</p>
<p>&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Table 3: The RED2 for data with rough attributes shown in Table 1</p>
<p>Objects Attributes Decision Flue (A4)</p>
<p>Temperature(A1)Muscles pain(A3)</p>
<p>O1 normal yes no</p>
<p>O2 high yes yes</p>
<p>O3 Very-high yes yes</p>
<p>O4 normal yes no</p>
<p>O5 high no no</p>
<p>O6 Very-high yes yes</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>In this way, the core and reduct of given rough context can be investigated. However, the characterization of rough attributes and its visualization is another issues. The author will try to focus on this issue in near future for knowledge processing tasks.</p>
<p>4. Conclusions&nbsp;</p>
<p>This paper introduces step by step method for dealing data with rough attributes, its approximation as well as rough membership function. The core reduct is also illustrated with an example. In near future the author will focus on defining the fuzzy rough membership and its graphical visualization for knowledge processing tasks.&nbsp;&nbsp;</p>
<p>Acknowledgements: Author thanks the editorial team for the valuable time.&nbsp;</p>
<p>Funding :Author declares that, there is no funding for this paper.&nbsp;</p>
<p>Conflicts of Interest: Author declares that, there is no conflict of interest.</p>
<p>Ethics approval: This article does not contain any studies with human or animals participants.</p>
<p>References</p>
<p>[1] Singh P. K., &ldquo;Three-way fuzzy concept lattice representation using neutrosophic set&rdquo;, International Journal of Machine Learning and Cybernetics,&nbsp; Vol 8, Issue 1, pp. 69-79, 2017.</p>
<p>&nbsp;[2] Singh PK, Ch. Aswani Kumar, &ldquo;Concept lattice reduction using different subset of attributes as information granules&rdquo;, Granular Computing, Vol. 2, Issue 3), pp. 159&ndash;173, 2017&nbsp;</p>
<p>&nbsp;[3] Singh PK, &ldquo;AntiGeometry and NeutroGeometry characterization of Non-Euclidean data sets&rdquo;, Journal of Neutrosophic and Fuzzy Systems, Vol 1, Issue 1, pp. 24-33, DOI: https://doi.org/10.54216/JNFS.0101012</p>
<p>[4] Singh PK, &ldquo;Data with Non-Euclidean Geometry and its Characterization,&rdquo; Journal of Artificial Intelligence and Technology, 2021, Vol. 2, Issue 1, pp-3-8., DOI: 10.37965/jait.2021.12001&nbsp;</p>
<p>[5] Singh PK, &ldquo;Cubic graph representation of concept lattice and its decomposition&rdquo;, Evolving System, doi: 10.1007/s12530-021-09400-6&nbsp;</p>
<p>[6] Pawlak Z, &ldquo;Rough sets&rdquo;, Int. J. Comput. Inf. Sci. Vol., pp. 341&ndash;356, 1982</p>
<p>[7] Pawlak Z, &ldquo;Rough set theory and its applications to data analysis,&rdquo; Cybern Syst Vol. 29, Issue 7, pp. 661&ndash;688, 1998&nbsp;</p>
<p>[8] He T, Chan Y, Shi K, &ldquo;Weighted rough graph and its application,&rdquo; In: Proceedings of IEEE Sixth Int Conf Intell Syst Des Appl 1:486&ndash;492, 2006</p>
<p>[9] He T, &ldquo;Rough properties of rough graph,&rdquo; Appl Mech Mater Vol 157&ndash;158, pp. 517&ndash;520, 2012</p>
<p>[10] He T, &ldquo;Representation form of rough graph,&rdquo; Appl Mech Mater, Vol. 157&ndash;158, pp. 874&ndash;877, 2012&nbsp;</p>
<p>[11] Liang M, Liang B, Wei L, Xu X, &ldquo;Edge rough graph and its application,&rdquo; In: Proc. Of eighth International Conference on Fuzzy Systems and Knowledge Discovery 2011, pp. 335&ndash;338&nbsp;</p>
<p>[12] Wang S, Zhu Q, Zhu W, Min F, &ldquo;Graph and matrix approaches to rough sets through matroids. Information Sciences, Vol. 288, pp. 1&ndash;11, 2014&nbsp;</p>
<p>[13] Li W, Huang Z, Jia X, Cai X, &ldquo;Neighborhood based decision-theoretic rough set models,&rdquo; International Journal of Approximate Reasoning, Vol. 69, pp. 1&ndash;17, 2016</p>
<p>[14] Noor R, Irshad I, Javaid I, &ldquo;Soft rough graphs&rdquo;. arXiv preprint arXiv:1707.05837, 2017</p>
<p>[15] Fariha Z, Akram M, &ldquo;A novel decision&ndash;making method based on rough fuzzy information,&rdquo; Int J Fuzzy Syst Vol. 20, Issue 3, pp. 1000&ndash;1014, 2018</p>
<p>[16] Rehman N, Shah N, Ali MI, Park C, &ldquo;Uncertainty measurement for neighborhood based soft covering rough graphs with applications,&rdquo; RACSAM, Vol. 113, pp. 2515&ndash;2535, 2019</p>
<p>[17] Mathew B, John SJ, Garg H, &ldquo;Vertex rough graphs,&rdquo; Complex Intell. Syst. Vol 6, pp. 347&ndash; 353, 2020&nbsp;</p>
<p>[18] Yao YY, &ldquo;Relational interpretations of neighborhood operators and rough set approximation operators,&rdquo; Inf. Sci., Vol. 101, pp. 239&ndash;259, 1998&nbsp;</p>
<p>[19] Yao YY, &ldquo;Three-way decisions with probabilistic rough sets,&rdquo; Inf. Sci., Vol. 180, pp. 341&ndash;353, 2010</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>